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NOVEL ANTIGEN CONSTRUCTS USEFUL IN THE 
DETECTION AND DIFFERENTIATION OF ANTIBODIES TO HTV 

Background of the Invention 
5 This invention relates generally to immunoassays for the detection and differentiation of 

antibodies to Human Immunodeficiency Virus Type 1 (HIV-1) Group M, HIV-1 Group O and 
Human Immunodeficiency Virus Type 2 (HIV-2). More particularly, the invention relates to 
novel antigen constructs useful as reagents in such assays, as well as polynucleotides, DNA 
clones, expression vectors, transformed host cells and the like which are useful in the 

1 0 preparation of such antigens. 

Detection of HIV infection in a patient, and characterization of the viral type, are 
typically carried out using immunoassays which rely on the highly specific interaction between 
antigens used as reagents in the assay and circulating antibodies in the patient's serum. The 
immunoreactivity of patient antibodies with some antigens, and to a lesser extent or not at all 

1 5 with others, permits the identification of the type and subtype of the HIV which is present 
Currently, there are two major phylogenetic groups of HIV-1 designated as Groups 
"M" and "O." G. Meyers et al., Human Retroviruses and AIDS 1995, Los Alamos National 
Laboratory, Los Alamos, NM (1995)* HIV-1 Group M isolates further have been divided into 
subgroups (A to J) that are phylogenetically approximately equidistant from each other. Group 

20 M isolates predominate worldwide. The earliest reports about the sequence of HIV-1 Group O 
indicated that these viruses were as closely related to a chimpanzee virus as to other HIV-1 
subgroups. See, for example, L.G. Giirtler et aL, 7. Virology 68:1581-1585 (1994); M. 
Vanden Haesevelde et al., J. Virology 68:1586-1596 (1994); De Leys et al., Virology 
64:1207-1216 (1990); DeLeys et aL, US. Patent No. 5,304,466; L.G* Giirtler et al., 

25 European Patent Publication No. 591914 A2, The Group O sequences are the most divergent 
of the HIV-1 sequences described to date. Although HIV-1 Group O strains are endemic to 
west central Africa (Cameroon, Equatorial Guinea, Nigeria and Gabon), patients infected with 
Group O isolates now have been identified in Belgium, France, Germany, Spain and the 
United States. See, for example, R. DeLeys et al., supra; P- Charneau et aL, Virology 

30 205:247-253 (1994); L Loussert-Ajaka et aL, /. Virology 69:5640-5649 (1995); H. Hampl et 
aL, Infection 23:369-370 (1995); A. Mas et aL, AIDS Res. Hum. Retroviruses 12:1647-1649 
(1996); M. Peters et al., AIDS 11:493-498 (1997); and M.A. Rayfield et aL, Emerging 
Infectious Diseases 2:209-212 (1996). 

HIV-1 Group M serology is characterized in large part by the amino acid sequences of 

35 the expressed viral proteins (antigens), particularly those comprising the core and envelope 
(env) regions. As between various strains of this rapidly-mutating virus, these antigens are 
structurally and functionally similar but have divergent amino acid sequences which elicit 
antibodies that are similar but not identical in their specificity for a particular antigen. 
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One of the key serological targets for detection of HIV-1 infection is the 41,000 MW 
transmembrane protein (IMP), glycoprotein 41 (gp41). gp41 is a highly immunogenic protein 
which elicits a strong and sustained antibody response in individuals considered seropositive 
for HIV. Antibodies to this protein are among the first to appear at seroconversion. The 
5 immune response to gp41 apparently remains relatively strong throughout the course of the 
disease, as evidenced by the near universal presence of anti-gp41 antibodies in asymptomatic 
patients as well as those exhibiting clinical stages of AIDS. A significant proportion of the 
antibody response to gp41 is directed toward a well-characterized immunodominant region 
(IDR) within gp41. 

10 Infections with HIV Type 2 (HIV-2), a virus initially found in individuals from Africa, 

now have been identified in humans outside of the initial endemic area of West Africa, and 
have been reported in Europeans who have lived in West Africa or those who have had sexual 
relations with individuals from this region. See, for example, A.G. Saimot et al., Lancet i:688 
(1987); M. A. Rey et al., Lancet i:388-389 (1987); A. Werner et aL, Lancet i:868-869 (1987); 

15 G. Brucker et al., Lancet i:223 (1987); K. Marquart et aL, AIDS 2: 141 (1988); CDC, MMWR 
37:33-35 (1987); Anonymous, Nature 332:295 (1988). Cases of AIDS due to HIV-2 have 
been documented world-wide. Serologic studies indicate that while HIV-1 and HIV-2 share 
multiple common epitopes in their core antigens, the envelope glycoproteins of these two 
viruses are much less cross-reactive. F. Clavel, AIDS 1:135-140 (1987). This limited cross- 

20 reactivity of the envelope antigens is believed to explain why currently available serologic 

assays for HIV-1 may fail to react with certain sera from individuals with antibody to HIV-2. 
R Denis et al., /. Clin. Micro. 26:1000-1004 (1988). Recently-issued U.S. Patent No. 
5,055,391 maps the HIV-2 genome and provides assays to detect the virus. 

These viral strains are, for the most part, readily identified and characterized using 

25 commercially-available diagnostic tests. However, concerns have arisen regarding the 

capability of currently-available immunoassays, designed for the detection of antibody to HIV- 
1 (Group M) and/or HIV-2, to detect the presence of antibody to HIV-1 Group O. I Loussert- 
Ajaka et al., Lancet 343:1393-1394 (1994); C.A. Schable et al., Lancet 344:1333-1334 (1994); 
L. Giirtler et al., /. ViroL Methods 51:177-184 (1995). Although, to date, few patients 

30 outside of west Central Africa have been found to be infected with HIV-1 Group O isolates, 
health officials fear the emergence of this subtype in other geographic areas as well. 

Consequently, there is a continued need for new antigens, suitable for use in 
immunoassays, which alone or in conjunction with other antigens permit the recognition of all 
HIV-1 (Group M and Group O) and HIV-2 isolates and/or infections. 
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Summary of the Invention 

It has now been found that certain polypeptides or combinations of are particularly 
useful in the detection of HIV-1 Group O and other HIV infections. Consequently, in a first 
aspect of the present invention is disclosed an isolated HIV-1 Group O env polypeptide having 
5 an amino acid sequence consisting essentially of the sequence of Figure 1 (SEQ ID NO:61), 
representing the full-length env region of the HIV-1 Group O isolate HAM112. Similarly 
disclosed is an isolated HIV-1 Group O env polypeptide comprising an immunoreactive portion 
of the above full-length polypeptide, as well as polynucleotides encoding such polypeptides. 
In a second aspect of the present invention, an antigen construct is disclosed which 

10 comprises a first HIV-1 Group O env polypeptide fused to a second HIV-1 Group O env 
polypeptide. Preferably, the first polypeptide of such an antigen construct is a gpl20 
polypeptide and the second polypeptide is a gp41 polypeptide, optionally with a portion of the 
hydrophobic region of the gp41 polypeptide being deleted so as to facilitate expression when 
expressed as a recombinant product Also preferred among the above antigen constructs are 

15 those in which at least one of the first and second HIV-1 Group O env polypeptides is derived 
from HIV-1 Group O isolate HAM1 12, as are those in which the first polypeptide comprises 
an immunoreactive portion of the gpl20 protein of HIV-1 Group O isolate HAM112. 

In the above Group O env constructs, the first polypeptide may have an amino acid 
sequence which consists essentially of residues 1 through 520 of the sequence shown in 

20 Figure 1 (SEQ ID NO:61), or alternatively an immunoreactive portion thereof. A shortened 
and preferred first polypeptide is one having an amino acid sequence consisting essentially of 
residues 476 through 520 of the sequence of Figure 1 (SEQ ID NO:61). Along with any of the 
above polypeptides, the second polypeptide used in the constructs of the invention may be an 
immunoreactive portion of the gp41 protein of HIV-1 Group O isolate HAM 11 2, from which a 

25 portion of the hydrophobic region of the gp41 protein of HIV-1 Group O isolate HAM1 12 is 
optionally absent In particular, the deleted portion may be that part of gp41 which has an 
amino acid sequence consisting essentially of residues 690 through 715 of the sequence of 
Figure 1 (SEQ ID NO:61). 

The above second polypeptide will preferably have an amino acid sequence consisting 

30 essentially of residues 521 through 873 of the sequence of Figure 1 (SEQ ID NO:61) or a 
portion thereof. More preferably, the second polypeptide may have an amino acid sequence 
consisting essentially of residues 47 through 373 of Figure 9 (SEQ ID NO:52); still more 
preferably, the amino acid sequence may consist essentially of residues 47 through 245 of 
Figure 7 (SEQ ID NO:48); and even more preferably, the amino acid sequence may consist 

35 essentially of residues 47 through 215 of Figure 5 (SEQ ID NO:58). Representative of the 
Group O env constructs of the invention are constructs pGO-8PL, pGO-8CKS, pGO-9PL, 
pGO-9CKS, pGO-HPL and pGO-llCKS, as well as any derivatives, variants and analogs 
thereof. 
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In a further aspect of the present invention, there is disclosed an antigen construct 
comprising a fusion of at least one HIV-1 Group O env polypeptide with at least one HIV-1 
Group M env polypeptide, and more preferably an antigen construct comprising a fusion of: 
(a) a first HIV-1 Group O env polypeptide; 
5 (b) a second HIV-1 Group O env polypeptide; 

(c) a first HIV-1 Group M env polypeptide; and 

(d) a second HIV-1 Group M env polypeptide. 

The HIV-1 Group M polypeptides in the above constructs may be derived from an HTV-1 
isolate of Subtype B, and preferably at least one is derived from HIV-1 Group M isolate 
10 HXB2R. In any of these Group O/Group M env constructs, at least one of the HIV- 1 Group 
O sequences may be derived from HTV-1 Group O isolate HAM112. 

More particularly, the first Group O env polypeptide and the first Group M env 
polypeptide may both be gpl20 polypeptides, while the second Group O env polypeptide and 
the second Group M env polypeptide may both be gp41 polypeptides. To enhance expression, 
15 a portion of the hydrophobic region of at least one of the gp41 polypeptides may be deleted. 
Antigen constructs included among the above are those in which: 

(a) the first HIV-1 Group O env polypeptide comprises an immunoreactive portion of 
the gpl20 protein of HTV-1 Group O isolate HAM112; 

(b) the second HTV-1 Group O env polypeptide comprises an immunoreactive portion 
20 of the gp41 protein of HIV- 1 Group O isolate HAM1 12 

(c) the first HTV-1 Group M env polypeptide comprises an immunoreactive portion of 
the gpl20 protein of a first HTV-1 Group M isolate of Subtype B; and 

(d) the second HTV-1 Group M env polypeptide comprises an immunoreactive portion 
of the gp41 protein of a second HIV- 1 Group M isolate of Subtype B. 

25 Preferred among these are constructs wherein the first and second HIV- 1 Group M isolates of 
Subtype B are the same and are HIV- 1 Group M isolate HXB2R, as well as those wherein a 
portion of the hydrophobic region of the gp41 protein of HTV- 1 Group M isolate HXB2R is 
absent from the second HTV-1 Group M env polypeptide. 

Preferred Group O/Group M env constructs include those in which (a) the first HTV-1 

30 Group M env polypeptide has an amino acid sequence consisting essentially of residues 25 1 
through 292 of the sequence of Figure 12 (SEQ ID NO:108), and (b) the second HTV-1 Group 
M env polypeptide has an amino acid sequence consisting essentially of residues 293 through 
599 of the sequence of Figure 12 (SEQ TD NO: 108) or a portion thereof. Especially preferred 
are those in which the second HTV-1 Group M env polypeptide has an amino acid sequence 

35 consisting essentially of residues 293 through 492 of the sequence of Figure 12 (SEQ ID 
NO: 108). 

Also preferred are the above Group O/Group M env constructs in which the first HTV-1 
Group O env polypeptide has an amino acid sequence consisting essentially of residues 1 
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through 520 of the sequence of Figure 1 (SEQ ID NO:61) or a portion thereof, and especially 
those comprising a first HIV-1 Group O env polypeptide which has an amino acid sequence 
consisting essentially of residues 476 through 520 of the sequence of Figure 1 (SEQ ID 
NO:61). The second HIV-1 Group O env polypeptide may be one having an amino acid 
5 sequence consisting essentially of residues 521 through 873 of the sequence of Figure 1 (SEQ 
ID NO:61) or a portion thereof, from which a portion of the hydrophobic region of the gp41 
protein of HTV-1 Group O isolate HAM1 12 may optionally be absent Preferred constructs are 
those in which such second fflV-1 Group O env polypeptides have an amino acid sequence 
consisting essentially of residues 47 through 373 of Figure 9 (SEQ ID NO:52); more preferred 
10 are those in which the second HIV- 1 Group O env polypeptide has an amino acid sequence 

consisting essentially of residues 47 through 245 of Figure 7 (SEQ ID NO:48); and even more 
preferred are those in which the second HIV-1 Group O env polypeptide has an amino acid 
sequence consisting essentially of residues 47 through 215 of Figure 5 (SEQ ID NO:58). 
Representative of the Group O/Group M env constructs of the invention are constructs 
15 pGO-12CKS, pGO-13CKS and pGO-14PL, and derivatives, variants and analogs thereof. 

In yet another aspect of the present invention, an antigen construct is disclosed which 
comprises a fusion of a first HIV-1 env polypeptide, a second HTV-1 env polypeptide, and at 
least one additional HIV-1 polypeptide, and especially one in which each such HIV-1 env 
polypeptides are HIV-1 Group O polypeptides. The first HTV-1 Group O env polypeptide of 
20 this construct may be a gpl20 polypeptide, and the second HTV-1 Group O env polypeptide a 
gp41 polypeptide. More particularly, the first HTV-1 Group O env polypeptide of this 
construct may comprise an immunoreactive portion of the gpl20 protein of HTV-1 Group O 
isolate HAM112, while the second HTV-1 Group O env polypeptide may comprise an 
immunoreactive portion of the gp41 protein of HTV-1 Group O isolate HAM1 12. 
25 Among these constructs, those in which the first HTV- 1 Group O env polypeptide has 

an amino acid sequence consisting essentially of residues 1 through 520 of the sequence of 
Figure 1 (SEQ ID NO:61), or a portion thereof, are preferred; more preferred are those in 
which the first HIV-1 Group O env polypeptide has an amino acid sequence consisting 
essentially of residues 476 through 520 of the sequence of Figure 1 (SEQ TD NO:61). As to 
30 the second HTV-1 Group O env polypeptide, which may have an amino acid sequence 
consisting essentially of residues 521 through 873 of the sequence of Figure 1 (SEQ ID 
NO:61) or a portion thereof and from which a portion of the hydrophobic region of the gp41 
protein of HTV-1 Group O isolate HAM112 may optionally be absent, preferred are those 
constructs in which that second HTV-1 Group O env polypeptide has an amino acid sequence 
35 consisting essentially of residues 47 through 373 of Figure 9 (SEQ TD NO:52). Even more 
preferred are those having a second HTV-1 Group O env polypeptide with an amino acid 
sequence consisting essentially of residues 47 through 245 of Figure 7 (SEQ TD NO:48), and 
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especially those in which the amino acid sequence consists essentially of residues 47 through 
215 of Figure 5 (SEQ ID NO:58). 

The additional HIV-1 polypeptide in any of these constructs may be a Group O env 
polypeptide; however, it is intended that it may alternatively be an immunogenic polypeptide 

5 from any of HIV- 1 Groups M or O or HIV-2, including env, gag, pol, reverse transcriptase, 
and regulatory and other viral components. Preferred in any case are those constructs in which 
the additional HIV-1 Group O polypeptide comprises an immunoreactive portion of the gp41 
protein of HIV-1 Group O isolate HAM112. Also preferred are those wherein the additional 
HIV-1 Group O polypeptide has an amino acid sequence consisting essentially of residues 521 

10 through 873 of the sequence of Figure 1 (SEQ ID NO:61) or a portion thereof, from which the 
hydrophobic region of the gp41 protein of HIV-1 Group O isolate HAM1 12 may optionally be 
absent Even more preferred are constructs in which the additional HIV-1 Group O env 
polypeptide has an amino acid sequence consisting essentially of residues 47 through 373 of 
Figure 9 (SEQ ID NO:52); particularly preferred are those in which the additional HIV-1 

15 Group O env polypeptide has an amino acid sequence consisting essentially of residues 47 
through 245 of Figure 7 (SEQ ID NO:48), and especially those wherein the additional HIV-1 
Group O env polypeptide has an amino acid sequence consisting essentially of residues 47 
through 215 of Figure 5 (SEQ ID NO:58). Most preferred are constructs having as the 
additional HIV-1 Group O env polypeptide the so-called immunodominant region (IDR) of 

20 HIV-1 Group O, which has an amino acid sequence consisting essentially of residues 250 
through 281 of Figure 17 (SEQ ID NO: 120). Representative of the above constructs are 
pGO-15CKS and pGO-15PL, as well as any derivatives, variants and analogs thereof. 

In still another aspect of the present invention is disclosed an antigen construct 
comprising a first HIV-2 env polypeptide fused to a second HIV-2 env polypeptide, and 

25 especially one in which the first HIV-2 env polypeptide is a gpl20 polypeptide and the second 
HIV-2 env polypeptide is a gp36 polypeptide. Preferred among the such constructs are those 
in which: 

(a) the first HIV-2 env polypeptide has an amino acid sequence consisting essentially 
of residues 248 through 307 of the sequence of Figure 1 1 (SEQ ID NO:55) or a portion 

30 thereof; and 

(b) the second HIV-2 env polypeptide has an amino acid sequence consisting 
essentially of residues 308 through 466 of the sequence of Figure 1 1 (SEQ ID NO:55) or a 
portion thereof. 

Representative of the HIV-2 constructs of the invention is pHIV-210 (SEQ ID NO:55), as well 
35 as any derivatives, variants and analogs thereof. 

An additional aspect of the present invention comprises polynucleotides encoding any 
of the above antigen construct, which polynucleotide may be operably linked to a control 
sequence capable of directing expression in a suitable host and/or have a coding sequence 
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which has been modified to provide a codon bias appropriate to the expression host. Still other 
aspects of the present invention include expression vectors comprising such polynucleotides 
and host cells transformed thereby, particularly where the host is Escherichia colL 

In a further aspect of the present invention, there is disclosed a method for detecting 
5 antibodies to HIV-1 in a test sample comprising the steps of: 

(a) combining at least one antigen construct according to the invention with the test 
sample to form a mixture; 

(b) incubating the mixture under conditions suitable for formation of complexes 
between the antigen and antibodies, if any, which are present in the sample and are 

10 immunologically reactive with the antigen; and 

(c) detecting the presence of any complexes formed. 

In one embodiment of the method, detection of the presence of complexes in step (c) is carried 
out using an additional antigen construct of the invention to which a signal-generating 
compound has been attached. In another embodiment, detection is carried out using an 

15 additional antigen construct of the invention to which a first member of a specific binding pah- 
is attached, and further using an indicator reagent comprising a second member of the specific 
binding pair to which is attached a signal-generating compound. A further embodiment 
provides that detection of the presence of complexes in step (c) is carried out using an antibody 
directed to the complexes formed in step (b), to which antibody is attached a signal-generating 

20 compound. Still another embodiment provides that detection of the presence of complexes in 
step (c) is carried out using an antibody directed to the complexes formed in step (b) and 
attached thereto a first member of a specific binding pair; such detection further requires the use 
of an indicator reagent comprising a second member of the specific binding pair to which is 
attached a signal-generating compound. 

25 In a final aspect of the present invention are disclosed immunoassay kits for the 

detection of antibodies to HIV-1, which kits comprise an antigen construct of the invention. 
Such construct may be used as a capture reagent or an indicator reagent. Alternatively, the 
antigen construct may be attached to a first member of a specific binding pair, the kit 
additionally comprising an indicator reagent comprising a second member of the specific 

30 binding pair attached to a signal-generating compound. 

Brief Description of the Drawings 

In the detailed description of the present invention which follows, reference is made to 
the attached drawings in which: 
35 FIGURE 1 shows the deduced amino acid sequence of the env protein from the HIV-1 

Group O isolate HAM112 (SEQ ID NO:61). 

FIGURE 2 depicts the strategy used to generate synthetic HIV-1 Group O env 
gpl20/gp41 gene constructs, wherein the pGO-8 insert = Osyn-5' to Osyn-P3'; pGO-9 insert 
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= Osyn-5' to Osyn-03'; pGO-11 insert = Osyn-5' to Osyn-M; and wherein H = the 
hydrophobic region of HIV- 1 Group O, deleted as shown. 

FIGURES 3A through 3D show a diagrammatic representation of the steps involved in 
construction of pGO-9PL/DH5a and pGO-9CKS/XLl. 
5 FIGURES 4A through 4G show a diagrammatic representation of the steps involved in 

construction of pGO-1 lPL/DH5a and pGO-1 1CKS/XL1. 

FIGURE 5 illustrates the amino acid sequence of the pGO-8PL recombinant protein 
(SEQIDNO:58). 

FIGURE 6 shows the amino acid sequence of the pGO-8CKS recombinant protein 

10 (SEQ ID NO:60). 

FIGURE 7 illustrates the amino acid sequence of the pGO-9PL recombinant protein 

(SEQIDNO:48X 

FIGURE 8 shows the amino acid sequence of the pGO-9CKS recombinant protein 
(SEQ ID NO:50). 

1 5 FIGURE 9 illustrates the amino acid sequence of the pGO- 1 1PL recombinant protein 

(SEQK)NO:52). 

FIGURE 10 shows the amino acid sequence of the pGO-1 1CKS recombinant protein 
(SEQIDNO:54). 

FIGURE 11 illustrates the amino acid sequence of the pHIV-210 recombinant protein 
20 (SEQK>NO:55). 

FIGURE 12 illustrates the amino acid sequence of the pGM-lCKS recombinant protein 
(SEQUENCE ID NO: 108). 

FIGURE 13 illustrates the amino acid sequence of the pGO-12CKS recombinant 
protein (SEQ ID NO:91), including an indication of the residues corresponding to the 
25 CKS/polylinker, env gpl20/gp41 from the HIV-1 group M isolate HXB2R, and env 
gpl20/gp41 from the HIV-1 group O isolate HAM112. 

FIGURE 14 illustrates the amino acid sequence of the pGO-13CKS recombinant 
protein (SEQ ID NO:93), including an indication of the residues corresponding to the 
CKS/polylinker, env gpl20/gp41 from the HIV-1 group M isolate HXB2R, and env 
30 gpl20/gp41 from the HIV- 1 group O isolate HAM1 12. 

FIGURE 15 illustrates the amino acid sequence of the pGO-14PL recombinant protein 
(SEQ ID NO:95), including an indication of the residues corresponding to env gpl20/gp41 
from the HIV-1 group M isolate HXB2R and env gpl20/gp41 from the HIV-1 group O isolate 
HAM 112. 

35 FIGURE 16 illustrates the amino acid sequence of the pGO-15CKS recombinant 

protein (SEQ ID NO:97), including an indication of the residues corresponding to the 
CKS/polylinker, env gpl20/gp41 from the HIV-1 group O isolate HAM112, a four-amino acid 
linker, and the second copy of the gp41 IDR from the HAM1 12 isolate. 
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FIGURE 17 illustrates the amino acid sequence of the pGO-15PL recombinant protein 
(SEQ ID NO: 120), including an indication of the residues corresponding to env gpl20/gp41 
from the HIV-1 group O isolate HAM1 12, a four- amino acid linker, and the second copy of the 
gp41 IDR from the HAM1 12 isolate. 

FIGURES 18-23 show the results obtained in coated-bead immunoassays (described in 
Example 14, below) testing the reactivity of the Group M antigen pTB319 and Group O 
recombinant antigens pGO-9CKS, pGO-1 1PL, pGO-12CKS, pGO-14PL and pGO-15CKS, 
respectively, with a panel of sera comprising HIVPL-31 (Group M-positive) and sera numbers 
14283, 189404, 193Ha, 14791, 267Ha andESP-1 (all Group O-positive). 



Detailed Description of the Invention 

In one embodiment of an isolated polypeptide of the present invention, the amino acid 
sequence of the env protein of the HIV-1 Group O isolate HAM1 12 is shown in Figure 1 (SEQ 
ID NO:61). In the present context, "isolated" is intended to mean that such polypeptides are 

15 relatively purified with respect to other viral or cellular components which normally would be 
present in situ, up to and including a substantially pure preparation of the protein. Such 
polypeptides can be utilized as assay reagents, for the production of monoclonal or polyclonal 
antibodies, in the manufacture of vaccines, or otherwise. 

Immunoreactive portions, or fragments, of the above polypeptides are also expected to 

20 be useful. By "immunoreactive" is meant portions of such length as are capable of eliciting an 
immune response in a host and/or of reacting with antibodies directed specifically thereto; 
preferably, such partial polypeptides will be five or more amino acids in length. It should also 
be noted that the term "portion" as used herein is directed to both terminally truncated 
sequences and those which are shortened by the removal of an intervening sequence. 

25 The above polypeptides and portions will best be produced by expression of 

polynucleotides encoding the same. These too permit a degree of variability in their sequence, 
as for example due to degeneracy of the genetic code, codon bias in favor of the host cell 
expressing the polypeptide, and conservative amino acid substitutions in the resulting protein. 
Moreover, it is anticipated that some variation of sequences will occur between ~ and possibly 

30 even within — a given HIV-1 isolate or other phylogenetic unit. Consequently, the 

polypeptides and constructs of the invention include not only those which are identical in 
sequence to the above sequence but also those which have an amino acid sequence that consist 
essentially of that reference sequence, where the term "consisting essentially" is meant to 
embrace variant polypeptides the structural and functional characteristics of which remain 

35 substantially the same. Preferably, such variants (or "analogs") will have a sequence 

homology ("identity") of 80% or more with the reference sequence of Figure 1. In this sense, 
techniques for determining amino acid sequence "similarity" are well-known in the art. In 
general, "similarity" means the exact amino acid to amino acid comparison of two or more 
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polypeptides at the appropriate place, where amino acids are identical or possess similar 
chemical and/or physical properties such as charge or hydrophobicity. A so-termed "percent 
similarity" then can be determined between the compared polypeptide sequences. Techniques 
for determining nucleic acid and amino acid sequence identity also are well known in the art 
and include determining the nucleotide sequence of the mRNA for that gene (usually via a 
cDNA intermediate) and determining the amino acid sequence encoded therein, and comparing 
this to a second amino acid sequence. In general, "identity" refers to an exact nucleotide to 
nucleotide or amino acid to amino acid correspondence of two polynucleotides or polypeptide 
sequences, respectively. Two or more polynucleotide sequences can be compared by 
determining their "percent identity", as can two or more amino acid sequences. The programs 
available in the Wisconsin Sequence Analysis Package, Version 8 (available from Genetics 
Computer Group, Madison, WI), for example, the GAP program, are capable of calculating 
both the identity between two polynucleotides and the identity and similarity between two 
polypeptide sequences, respectively. Other programs for calculating identity or similarity 
between sequences are known in the art 

According to another embodiment of the invention, antigen constructs are provided 
which are suitable for use in the detection of anti-HIV-1 antibodies. As described in greater 
detail below, such constructs may be prepared by recombinant means, as synthetic peptides, or 
otherwise; moreover, they may be glycosylated or unglycosylated depending on the manner 
and/or host cell by which they are made. Consequently, although referred to as if comprising 
glycoproteins (for example, "a gpl20 polypeptide"), the antigen constructs of the invention are 
intended to include those which are expressed in bacterial hosts such as E. coli and are 

therefore unglycosylated. 

It should be noted that the above constructs are fusions of various sequences, that is, 
the constructs are formed by joining various epitope-containing sequences, as for example by 
co-expression, ligation or sequential synthesis. Also joined thereto, and optionally included in 
the constructs of the invention, are other polypeptide sequences such as expression (CKS) 
polylinkers and other linker sequences. The order of the various polypeptide sequences is not 
critical; consequently, the polypeptides and their epitopes may be re-arranged as a matter of 
convenience. Further modifications are also possible, as for example by random mutation or 
site-directed mutagenesis or even the deletion (removal or omission) of certain regions such as 
the gp41 hydrophobic region, the absence of which has been found to enhance expression of 
the remaining polypeptide. In any case, whether nearly the same or substantially changed, 
polypeptides which undergo these modifications may be said to be "derived" from then- 
respective sources, and the resulting polypeptides may be regarded as "derivatives". 

In yet another aspect of the present invention, assay methods are provided which utilize 
the constructs of the invention in the detection of anti-HIV-1 antibodies in test samples. Such 
methods permit the direct testing of biological specimens; however, the assay methods may 
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also be modified to permit the testing of pre-processed specimens such as sera, lysed ceUs, and 
extracts or preparations made therefrom (as by concentration, dilution, separation, fixation 
and/or immobilization). Depending on the desired assay format, the antigen constructs may 
also be modified for use in such assays, as for example by labeling, immobilization on a solid 
phase or otherwise, or conjugation to other assay reagents. 

Certain terms used herein are intended to have specialized meanings. Unless otherwise 
stated, the terms below shall have the following meanings: 

The term "primer" denotes a specific oligonucleotide sequence complementary to a 
target nucleotide sequence and used to hybridize to the target nucleotide sequence. It serves as 
an initiation point for nucleotide polymerization catalyzed by either DNA polymerase, RNA 
polymerase or reverse transcriptase. 

The term "polynucleotide" as used herein means a polymeric form of nucleotides of any 
length, either ribonucleotides or deoxyribonucleotides. This term refers only to the primary 
structure of the molecule. Thus, the term includes double- and single-stranded DNA as well as 
double- and single-stranded RNA. It also includes modifications, such as methylation or 
capping, and unmodified forms of the polynucleotide. 

"Encoded by" refers to a nucleic acid sequence which codes for a polypeptide 
sequence. Also encompassed are polypeptide sequences which are immunologically 
identifiable with a polypeptide encoded by the sequence. Thus, a "polypeptide," "protein," or 
"amino acid" sequence as claimed herein may have at least 60% similarity, more preferably at 
least about 70% similarity, and most preferably about 80% similarity to a particular polypeptide 
or amino acid sequence specified below. 

The terms "recombinant polypeptide" or "recombinant protein", used interchangeably 
herein, describe a polypeptide which by virtue of its origin or manipulation is not associated 
with all or a portion of the polypeptide with which it is associated in nature and/or is linked to a 
polypeptide other than that to which it is linked in nature. A recombinant or encoded 
polypeptide or protein is not necessarily translated from a designated nucleic acid sequence. It 
also may be generated in any manner, including chemical synthesis or expression of a 
recombinant expression system. 

"Polypeptide" and "protein" are used interchangeably herein and indicate a molecular 
chain of amino acids linked through covalent and/or noncovalent bonds. The terms do not 
refer to a specific length of the product. Thus, peptides, oligopeptides and proteins are 
included within the definition of polypeptide. The terms include post-expression modifications 
of the polypeptide, for example, glycosylations, acetylations, phosphorylations and the like. 
In addition, protein fragments, analogs, mutated or variant proteins, fusion proteins and the 
like are included within the meaning of polypeptide. 
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A "fragment" of a specified polypeptide refers to an amino acid sequence which 
comprises at least about 3-5 amino acids, more preferably at least about 8-10 amino acids, and 
even more preferably at least about 15-20 amino acids, derived from the specified polypeptide. 

The term "synthetic peptide" as used herein means a polymeric form of amino acids of 
any length, which may be chemically synthesized by methods well-known to those skilled in 
the art. These synthetic peptides are useful in various applications. 

"Purified polypeptide" means a polypeptide of interest or fragment thereof which is 
essentially free, that is, contains less than about 50%, preferably less than about 70%, and 
more preferably, less than about 90% of cellular components with which the polypeptide of 
interest is naturally associated. Methods for purifying are known in the art. 

The term "isolated" means that the material is removed from its original environment 
(e.g., the natural environment if it is naturally occurring). For example, a naturally-occurring 
polynucleotide or polypeptide present in a living animal is not isolated, but the same 
polynucleotide or DNA or polypeptide, which is separated from some or all of the coexisting 
materials in the natural system, is isolated. Such polynucleotide could be part of a vector 
and/or such polynucleotide or polypeptide could be part of a composition, and still be isolated 
in that the vector or composition is not part of its natural environment. 

"Recombinant host cells," "host cells," "cells," "cell lines," "cell cultures," and other 
such terms denoting microorganisms or higher eukaryotic cell lines cultured as unicellular 
entities refer to cells which can be, or have been, used as recipients for recombinant vector or 
other transferred DNA, and include the original progeny of the original cell which has been 
transfected. 

As used herein "replicon" means any genetic element, such as a plasmid, a 
chromosome or a virus, that behaves as an autonomous unit of polynucleotide replication 
within a cell. 

A "vector" is a replicon to which another polynucleotide segment is attached, such as to 
bring about the replication and/or expression of the attached segment. 

The term "control sequence" refers to polynucleotide sequences which are necessary to 
effect the expression of coding sequences to which they are ligated. The nature of such control 
sequences differs depending upon the host organism. In prokaryotes, such control sequences 
generally include promoter, ribosomal binding site and terminators; in eukaryotes, such control 
sequences generally include promoters, terminators and, in some instances, enhancers. The 
term "control sequence" thus is intended to include at a minimum all components whose 
presence is necessary for expression, and also may include additional components whose 
presence is advantageous, for example, leader sequences. 

"Operably linked" refers to a situation wherein the components described are in a 
relationship permitting them to function in their intended manner. Thus, for example, a control 
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sequence "operably linked" to a coding sequence is ligated in such a manner that expression of 
the coding sequence is achieved under conditions compatible with the control sequences. 

A "coding sequence" is a polynucleotide sequence which is transcribed into mRNA and 
translated into a polypeptide when placed under the control of appropriate regulatory 
sequences. The boundaries of the coding sequence are determined by and include a translation 
start codon at the 5' -terminus and one or more translation stop codons at the 3' -terminus. A 
coding sequence can include, but is not limited to, mRNA, cDNA, and recombinant 
polynucleotide sequences. 

The term "immunologically identifiable with/as" refers to the presence of epitope(s) and 
polypeptide(s) which also are present in and are unique to the designated polypeptide(s). 
Immunological identity may be determined by antibody binding and/or competition in binding. 
These techniques are known to the skilled artisan and also are described herein. The 
uniqueness of an epitope also can be determined by computer searches of known data banks, 
such as GenBank, for the polynucleotide sequences which encode the epitope, and by amino 
acid sequence comparisons with other known proteins. 

As used herein, "epitope" means an antigenic determinant of a polypeptide. 
Conceivably, an epitope can comprise three amino acids in a spatial conformation which is 
unique to the epitope. Generally, an epitope consists of at least five such amino acids, and 
more usually, it consists of at least eight to ten amino acids. Methods of examining spatial 
conformation are known in the art and include, for example, x-ray crystallography and two- 
dimensional nuclear magnetic resonance. 

A "conformational epitope" is an epitope that is comprised of specific juxtaposition of 
amino acids in an immunologically recognizable structure, such amino acids being present on 
the same polypeptide in a contiguous or non-contiguous order or present on different 
polypeptides. 

A polypeptide is "immunologically reactive" with an antibody when it binds to an 
antibody due to antibody recognition of a specific epitope contained within the polypeptide. 
Immunological reactivity may be determined by antibody binding, more particularly by the 
kinetics of antibody binding, and/or by competition in binding using as competitor(s) a known 
polypeptide(s) containing an epitope against which the antibody is directed. The methods for 
determining whether a polypeptide is immunologically reactive with an antibody are known in 
the art. 

The term "transformation" refers to the insertion of an exogenous polynucleotide into a 
host cell, irrespective of the method used for the insertion. For example, direct uptake, 
transduction or f-mating are included. The exogenous polynucleotide may be maintained as a 
non-integrated vector, for example, a plasmid, or alternatively, may be integrated into the host 
genome. 
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The term "test sample" refers to a component of an individual's body which is the 
source of the analyte (such as, antibodies of interest or antigens of interest). These components 
are well known in the art These test samples include biological samples which can be tested 
by the methods of the present invention described herein and include human and animal body 
fluids such as whole blood, serum, plasma, cerebrospinal fluid, urine, lymph fluids, and 
various external secretions of the respiratory, intestinal and genitorurinary tracts, tears, saliva, 
milk, white blood cells, myelomas and the like; biological fluids such as cell culture 
supernatants; fixed tissue specimens; and fixed cell specimens. 

"Purified product" refers to a preparation of the product which has been isolated from 
the cellular constituents with which the product is normally associated, and from other types of 
cells which may be present in the sample of interest. 

The present invention provides assays which utilize specific binding members. A 
"specific binding member," as used herein, is a member of a specific binding pair. That is, 
two different molecules where one of the molecules through chemical or physical means 
specifically binds to the second molecule. Therefore, in addition to antigen and antibody 
specific binding pairs of common immunoassays, other specific binding pairs can include 
biotin and avidin, carbohydrates and lectins, complementary nucleotide sequences, effector and 
receptor molecules, cofactors and enzymes, enzyme inhibitors and enzymes, and the like. 
Furthermore, specific binding pairs can include members that are analogs of the original 
specific binding members, for example, an analyte-analog. Immunoreactive specific binding 
members include antigens, antigen fragments, antibodies and antibody fragments, both 
monoclonal and polyclonal, and complexes thereof, including those formed by recombinant 
DNA molecules. 

A "capture reagent," as used herein, refers to an unlabeled specific binding member 
which is specific either for the analyte as in a sandwich assay, for the indicator reagent or 
analyte as in a competitive assay, or for an ancillary specific binding member, which itself is 
specific for the analyte, as in an indirect assay. The capture reagent can be directly or indirectly 
bound to a solid phase material before the performance of the assay or during the performance 
of the assay, thereby enabling the separation of immobilized complexes from the test sample. 

The "indicator reagent" comprises a "signal-generating compound" ("label") which is 
capable of generating and generates a measurable signal detectable by external means, 
conjugated ("attached") to a specific binding member. "Specific binding member" as used 
herein means a member of a specific binding pair. That is, two different molecules where one 
of the molecules through chemical or physical means specifically binds to the second molecule. 
In addition to being an antibody member of a specific binding pair, the indicator reagent also 
can be a member of any specific binding pair, including either hapten-anti-hapten systems such 
as biotin or anti-biotin, avidin or biotin, a carbohydrate or a lectin, a complementary nucleotide 
sequence, an effector or a receptor molecule, an enzyme cofactor and an enzyme, an enzyme 
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inhibitor or an enzyme, and the like. An immunoreactive specific binding member can be an 
antibody, an antigen, or an antibody/antigen complex that is capable of binding either to 
polypeptide of interest as in a sandwich assay, to the capture reagent as in a competitive assay, 
or to the ancillary specific binding member as in an indirect assay. 

The various "signal-generating compounds" (labels) contemplated include chromogens, 
catalysts such as enzymes, luminescent compounds such as fluorescein and rhodamine, 
chemiluminescent compounds such as dioxetanes, acridiniums, phenanthridiniums and 
luminol, radioactive elements, and direct visual labels. Examples of enzymes include alkaline 
phosphatase, horseradish peroxidase, beta-galactosidase, and the like. The selection of a 
particular label is not critical, but it will be capable of producing a signal either by itself or in 
conjunction with one or more additional substances. 

"Solid phases" ("solid supports") are known to those in the art and include the walls of 
wells of a reaction tray, test tubes, polystyrene beads, magnetic beads, nitrocellulose strips, 
membranes, microparticles such as latex particles, sheep (or other animal) red blood cells, and 
Duracytes® (red blood cells "fixed" by pyruvic aldehyde and formaldehyde, available from 
Abbott Laboratories, Abbott Park, IL) and others. The "solid phase" is not critical and can be 
selected by one skilled in the art. Thus, latex particles, microparticles, magnetic or non- 
magnetic beads, membranes, plastic tubes, walls of microliter wells, glass or silicon chips, 
sheep (or other suitable animal's) red blood cells and Duracytes® are all suitable examples. 
Suitable methods for immobilizing peptides on solid phases include ionic, hydrophobic, 
covalent interactions and the like. A "solid phase", as used herein, refers to any material which 
is insoluble, or can be made insoluble by a subsequent reaction. The solid phase can be chosen 
for its intrinsic ability to attract and immobilize the capture reagent. Alternatively, the solid 
phase can retain an additional receptor which has the ability to attract and immobilize the 
capture reagent. The additional receptor can include a charged substance that is oppositely 
charged with respect to the capture reagent itself or to a charged substance conjugated to the 
capture reagent. As yet another alternative, the receptor molecule can be any specific binding 
member which is immobilized upon (attached to) the solid phase and which has the ability to 
immobilize the capture reagent through a specific binding reaction. The receptor molecule 
enables the indirect binding of the capture reagent to a solid phase material before the 
performance of the assay or during the performance of the assay. The solid phase thus can be 
a plastic, derivatized plastic, magnetic or non-magnetic metal, glass or silicon surface of a test 
tube, microliter well, sheet, bead, microparticle, chip, sheep (or other suitable animal's) red 
blood cells, Duracytes® and other configurations known to those of ordinary skill in the art 

It is contemplated and within the scope of the present invention that the solid phase also 
can comprise any suitable porous material with sufficient porosity to allow access by detection 
antibodies and a suitable surface affinity to bind antigens. Microporous structure generally are 
preferred, but materials with gel structure in the hydrated state may be used as well. Such 
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useful solid supports include but are not limited to nitrocellulose and nylon. It is contemplated 
that such porous solid supports described herein preferably are in the form of sheets of 
thickness from about 0.01 to 0.5 mm, preferably about 0.1 mm. The pore size may vary 
within wide limits, and preferably is from about 0.025 to 15 microns, especially from about 
0.15 to 15 microns. The surface of such supports may be activated by chemical processes 
which cause covalent linkage of the antigen or antibody to the support. The irreversible 
binding of the antigen or antibody is obtained, however, in general, by adsorption on the 
porous material by poorly understood hydrophobic forces. Other suitable solid supports are 
known in the art. 

The present invention provides polynucleotide sequences derived from human 
immunodeficiency viruses of interest and polypeptides encoded thereby. The polynucleotide(s) 
may be in the form of mRNA or DNA. Polynucleotides in the form of DNA, cDNA, genomic 
DNA, and synthetic DNA are within the scope of the present invention. The DNA may be 
double-stranded or single-stranded, and if single stranded may be the coding (sense) strand or 
non-coding (anti-sense) strand. The coding sequence which encodes the polypeptide may be 
identical to the coding sequence provided herein or may be a different coding sequence which 
coding sequence, as a result of the redundancy or degeneracy of the genetic code, encodes the 
same polypeptide as the DNA provided herein. 

This polynucleotide may include only the coding sequence for the polypeptide, or the 
coding sequence for the polypeptide and additional coding sequence such as a leader or 
secretory sequence or a proprotein sequence, or the coding sequence for the polypeptide (and 
optionally additional coding sequence) and non-coding sequence, such as a non-coding 
sequence 5' and/or 3' of the coding sequence for the polypeptide. 

In addition, the invention includes variant polynucleotides containing modifications 
such as polynucleotide deletions, substitutions or additions; and any polypeptide modification 
resulting from the variant polynucleotide sequence. A polynucleotide of the present invention 
also may have a coding sequence which is a naturally-occurring variant of the coding sequence 
provided herein. 

In addition, the coding sequence for the polypeptide may be fused in the same reading 
frame to a polynucleotide sequence which aids in expression and secretion of a polypeptide 
from a host cell, for example, a leader sequence which functions as a secretory sequence for 
controlling transport of a polypeptide from the cell. The polypeptide having a leader sequence 
is a preprotein and may have the leader sequence cleaved by the host cell to form the form of 
the polypeptide. The polynucleotides may also encode for a proprotein which is the protein 
plus additional 5' amino acid residues. A protein having a prosequence is a proprotein and 
may in some cases be an inactive form of the protein. Once the prosequence is cleaved an 
active protein remains. Thus, the polynucleotide of the present invention may encode for a 
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protein, or for a protein having a prosequence or for a protein having both a presequence 
(leader sequence) and a prosequence. 

The polynucleotides of the present invention may also have the coding sequence fused 
in frame to a marker sequence which allows for purification of the polypeptide of the present 
invention. The marker sequence may be a hexa-histidine tag supplied by a pQE-9 vector to 
provide for purification of the polypeptide fused to the marker in the case of a bacterial host, 
or, for example, the marker sequence may be a hemagglutinin (HA) tag when a mammalian 
host, e.g. COS-7 cells, is used. The HA tag corresponds to an epitope derived from the 
influenza hemagglutinin protein. See, for example, I. Wilson et al., Cell 37:767 (1984). 

The present invention further relates to HTV-1 polypeptides which have the deduced 
amino acid sequence as provided herein, as well as fragments, analogs and derivatives of such 
polypeptides. The polypeptides of the present invention may be recombinant polypeptides, 
natural purified polypeptides or synthetic polypeptides. The fragment, derivative or analog of 
such a polypeptide may be one in which one or more of the amino acid residues is substituted 
with a conserved or non-conserved amino acid residue (preferably a conserved amino acid 
residue) and such substituted amino acid residue may or may not be one encoded by the genetic 
code; or it may be one in which one or more of the amino acid residues includes a substituent 
group; or it may be one in which the polypeptide is fused with another compound, such as a 
compound to increase the half-life of the polypeptide (for example, polyethylene glycol); or it 
may be one in which the additional amino acids are fused to the polypeptide, such as a leader or 
secretory sequence or a sequence which is employed for purification of the polypeptide or a 
proprotein sequence. Such fragments, derivatives and analogs are within the scope of the 
present invention. The polypeptides and polynucleotides of the present invention are 
preferably provided in an isolated form, and preferably purified. 

Thus, a polypeptide of the present invention may have an amino acid sequence that is 
identical to that of the naturaUy-occurring polypeptide or that is different by minor variations 
due to one or more amino acid substitutions. The variation may be a "conservative change" 
typically in the range of about 1 to 5 amino acids, wherein the substituted amino acid has 
similar structural or chemical properties, e.g., replacement of leucine with isoleucine or 
threonine with serine. In contrast, variations may include nonconservative changes, e.g., 
replacement of a glycine with a tryptophan. Similar minor variations may also include amino 
acid deletions or insertions, or both. Guidance in determining which and how many amino 
acid residues may be substituted, inserted or deleted without changing biological or 
immunological activity may be found using computer programs well known in the art, for 
example, DNASTAR software (DNASTAR Inc., Madison WI). 

The recombinant polypeptides of the present invention can be produced not only as 
demonstrated below, but also according to a number of alternative methods and using a variety 
of host cells and expression vectors. Host cells are genetically engineered (transduced or 
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transformed or transfected) with the vectors of this invention which may be a cloning vector or 
an expression vector. The vector may be in the form of a plasmid, a viral particle, a phage, etc. 
The engineered host cells can be cultured in conventional nutrient media modified as 
appropriate for activating promoters, selecting transformants or amplifying HIV-derived genes. 
The culture conditions, such as temperature, pH and the like, are those previously used with 
the host ceU selected for expression, and will be apparent to the ordinarily skilled artisan. 

The polynucleotides of the present invention may be employed for producing a 
polypeptide by recombinant techniques. Thus, the polynucleotide sequence may be included in 
any one of a variety of expression vehicles, in particular vectors or plasmids for expressing a 
polypeptide. Such vectors include chromosomal, nonchromosomal and synthetic DNA 
sequences, e.g., derivatives of SV40; bacterial plasmids; phage DNA; yeast plasmids; vectors 
derived from combinations of plasmids and phage DNA, viral DNA such as vaccinia, 
adenovirus, fowl pox virus, and pseudorabies. However, any other plasmid or vector may be 
used so long as it is replicable and viable in the host. 

The appropriate DNA sequence may be inserted into the vector by a variety of 
procedures. Li general, the DNA sequence is inserted into appropriate restriction endonuclease 
sites by procedures known in the art. Such procedures and others are deemed to be within the 
scope of those skilled in the art. The DNA sequence in the expression vector is operatively 
linked to an appropriate expression control sequence(s) (promoter) to direct mRNA synthesis. 
Representative examples of such promoters include but are not limited to LTR or S V40 
promoter, the E. coli lac or trp, the phage lambda P sub L promoter and other promoters 
known to control expression of genes in prokaryotic or eukaryotic cells or their viruses. The 
expression vector also contains a ribosome binding site for translation initiation and a 
transcription terminator. The vector may also include appropriate sequences for amplifying 
expression. In addition, the expression vectors preferably contain a gene to provide a 
phenotypic trait for selection of transformed host cells such as dihydrofolate reductase or 
neomycin resistance for eukaryotic cell culture, or such as tetracycline or ampicillin resistance 
in E. coli. 

The vector containing the appropriate DNA sequence as hereinabove described, as well 
as an appropriate promoter or control sequence, may be employed to transform an appropriate 
host to permit the host to express the protein. As representative examples of appropriate hosts, 
there may be mentioned: bacterial cells, such as E. coli, Salmonella typhimurium; Streptomyces 
sp:, fungal cells, such as yeast; insect cells such as Drosophila and Sf9; animal cells such as 
Chinese hamster ovary (CHO), COS or Bowes melanoma; plant cells, etc. The selection of an 
appropriate host is deemed to be within the scope of those skilled in the art from the teachings 
provided herein. 

More particularly, the present invention also includes recombinant constructs 
comprising one or more of the sequences as broadly described above. The constructs comprise 
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a vector, such as a plasmid or viral vector, into which a sequence of the invention has been 
inserted, in a forward or reverse orientation. In a preferred aspect of this embodiment, the 
construct further comprises regulatory sequences, including, for example, a promoter, 
operably linked to the sequence. Large numbers of suitable vectors and promoters are known 
to those of skill in the art, and are commercially available. The f ollowing vectors are provided 
by way of example. Bacterial: pINCY (Incyte Pharmaceuticals Inc., Palo Alto, CA), 
pSPORTl (Life Technologies, Gaithersburg, MD), pQE70, pQE60, pQE-9 (Qiagen) pBs, 
phagescript, psiX174, pBluescript SK, pBsKS, pNH8a, pNH16a, pNH18a, pNH46a 
(Stratagene); P Trc99A, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia). Eukaryotic: 
pWLneo, pSV2cat, pOG44, pXTl, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL 
(Pharmacia). However, any other plasmid or vector may be used as long as it is replicable and 
viable in the host. 

Promoter regions can be selected from any desired gene using CAT (chloramphenicol 
transferase) vectors or other vectors with selectable markers. Two appropriate vectors are 
pKK232-8 and pCM7. Particular named bacterial promoters include lacl, lacZ, T3, SP6, T7, 
gpt, lambda P sub R, P sub L and trp. Eukaryotic promoters include cytomegalovirus (CMV) 
immediate early, herpes simplex virus (HSV) thymidine kinase, early and late SV40, LTRs 
from retrovirus, and mouse metaUothionein-I. Selection of the appropriate vector and promoter 
is well within the level of ordinary skill in the art. 

The host cell used herein can be a higher eukaryotic cell, such as a mammalian cell, or a 
lower eukaryotic cell, such as a yeast cell, or the host cell can be a prokaryotic cell, such as a 
bacterial ceU. Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-Dextran mediated transfection, or electroporation (L. Davis et 
al., "Basic Methods in Molecular Biology", 2nd edition, Appleton and Lang, Paramount 
Publishing, East Norwalk, CT [1994]). 

The constructs in host cells can be used in a conventional manner to produce the gene 
product encoded by the recombinant sequence. Alternatively, the polypeptides of the invention 
can be synthetically produced by conventional peptide synthesizers. 

Proteins can be expressed in mammalian cells, yeast, bacteria, or other cells under the 
control of appropriate promoters. Cell-free translation systems also can be employed to 
produce such proteins using RNAs derived from the DNA constructs of the present invention. 
Appropriate cloning and expression vectors for use with prokaryotic and eukaryotic hosts are 
described by Sambrook et al., Molecular Cloning: A Laboratory Manual, Second Edition, 
(Cold Spring Harbor, N.Y., 1989), which is hereby incorporated by reference. 

Transcription of a DNA encoding the polypeptides of the present invention by higher 
eukaryotes is increased by inserting an enhancer sequence into the vector. Enhancers are cis- 
acting elements of DNA, usually about from 10 to 300 bp, that act on a promoter to increase its 
transcription. Examples include the S V40 enhancer on the late side of the replication origin (bp 
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100 to 270), a cytomegalovirus early promoter enhancer, a polyoma enhancer on the late side 
of the replication origin, and adenovirus enhancers. 

Generally, recombinant expression vectors will include origins of replication and 
selectable markers permitting transformation of the host cell, e.g., the ampicillin resistance 
gene of E. coli and the S. cerevisiae TRP1 gene, and a promoter derived from a highly- 
expressed gene to direct transcription of a downstream structural sequence. Such promoters 
can be derived from operons encoding glycolytic enzymes such as 3-phosphoglycerate kinase 
(PGK), alpha factor, acid phosphatase, or heat shock proteins, among others. The 
heterologous structural sequence is assembled in appropriate phase with translation initiation 
and termination sequences, and preferably, a leader sequence capable of directing secretion of 
translated protein into the periplasmic space or extracellular medium. Optionally, the 
heterologous sequence can encode a fusion protein including an N-terminal identification 
peptide imparting desired characteristics, e.g., stabilization or simplified purification of 
expressed recombinant product. 

Useful expression vectors for bacterial use are constructed by inserting a structural 
DNA sequence encoding a desired protein together with suitable translation initiation and 
termination signals in operable reading phase with a functional promoter. The vector will 
comprise one or more phenotypic selectable markers and an origin of replication to ensure 
maintenance of the vector and to, if desirable, provide amplification within the host Suitable 
prokaryotic hosts for transformation include E. coli, Bacillus subtilis, Salmonella typhimurium 
and various species within the genera Pseudomonas, Streptomyces, and Staphylococcus, 
although others may also be employed as a routine matter of choice. 

Useful expression vectors for bacterial use comprise a selectable marker and bacterial 
origin of replication derived from plasmids comprising genetic elements of the well-known 
cloning vector pBR322 (ATCC 37017). Other vectors include but are not limited to PKK223- 
3 (Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM1 (Promega Biotec, Madison, WI). 
These pBR322 "backbone" sections are combined with an appropriate promoter and the 
structural sequence to be expressed. 

Following transformation of a suitable host strain and growth of the host strain to an 
appropriate cell density, the selected promoter is derepressed by appropriate means (e.g., 
temperature shift or chemical induction), and cells are cultured for an additional period. Cells 
are typically harvested by centrifugation, disrupted by physical or chemical means, and the 
resulting crude extract retained for further purification. Microbial cells employed in expression 
of proteins can be disrupted by any convenient method, including freeze-thaw cycling, 
sonication, mechanical disruption, or use of cell lysing agents; such methods are well-known 
to the ordinary artisan. 

Various mammalian cell culture systems can also be employed to express recombinant 
protein. Examples of mammalian expression systems include the COS-7 lines of monkey 



-20- 



6165.US.O! 



kidney fibroblasts described by Gluzman, Cell 23:175 (1981), and other cell lines capable of 
expressing a compatible vector, such as the C127, 3T3, CHO, HeLa and BHK cell lines. 
Mammalian expression vectors will comprise an origin of replication, a suitable promoter and 
enhancer, and also any necessary ribosome binding sites, polyadenylation site, splice donor 
and acceptor sites, transcriptional termination sequences, 5' flanking nontranscribed 
sequences, and selectable markers such as the neomycin phosphotransferase gene. DNA 
sequences derived from the S V40 viral genome, for example, S V40 origin, early promoter, 
enhancer, splice, and polyadenylation sites may be used to provide the required nontranscribed 
genetic elements. Representative, useful vectors include pRc/CMV and pcDNA3 (available 
from Invitrogen, San Diego, CA). 

The HlV-derived polypeptides are recovered and purified from recombinant cell 
cultures by known methods including ammonium sulfate or ethanol precipitation, acid 
extraction, anion or cation exchange chromatography, phosphocellulose chromatography, 
hydrophobic interaction chromatography, hydroxyapatite chromatography or lectin 
chromatography. It is preferred to have low concentrations (approximately 0.1-5 mM) of 
calcium ion present during purification (Price et al., /. Biol. Chem. 244:917 [1969]). Protein 
refolding steps can be used, as necessary, in completing configuration of the protein. Finally, 
high performance liquid chromatography (HPLC) can be employed for final purification steps. 

The polypeptides of the present invention may be naturally purified products expressed 
from a high expressing cell line, or a product of chemical synthetic procedures, or produced by 
recombinant techniques from a prokaryotic or eukaryotic host (for example, by bacterial, yeast, 
higher plant, insect and mammalian cells in culture). Depending upon the host employed in a 
recombinant production procedure, the polypeptides of the present invention may be 
glycosylated with mammalian or other eukaryotic carbohydrates or may be non-glycosylated. 
The polypeptides of the invention may also include an initial methionine amino acid residue. 

The present invention further includes modified versions of the recombinant 
polypeptide to preclude glycosylation while allowing expression of a reduced carbohydrate 
form of the protein in yeast, insect or mammalian expression systems. Known methods for 
inactivating gylcosylation sites include, but are not hrnited to, those presented in U.S. patent 
5,071,972 and EP 276,846, which are incorporated herein by reference. 

Other variants included in the present invention include those obtained by removal 
removal of sequences encoding cystein residues, thereby preventing formation of incorrect 
intramolecular disulfide bridges which decrease biological activity of the protein product. The 
constructs of the present invention also may be prepared by removal of the site of proteolytic 
processing, allowing expression in systems which contain a problematic protease, for example 
the KEX2 protease in yeast. Known methods for removing such protease sites include but are 
not limited to one method for removing KEX2 sites presented in EP212,914. 
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The present invention includes the above peptides in the form of oligomers, dimers, 
trimers and higher order oligomers. Oligomers may be formed by several means including but 
not limited to disulfide bonds between peptides, non-covalent interactions between peptides, 
and poly-ethylene-glycol linkages between peptides. 

The fusion of the above peptides to peptide linkers or peptides that are capable of 
promoting oligomers is also encompassed in this invention. Such peptides include but are not 
limited to leucine zippers and antibody derived peptides, such as is described in Landschulz et 
al., Science 240:1759 (1988); Hollenbaugh and Aruffo, "Construction of Immunoglobin 
Fusion Proteins", in Current Protocols in Immunology, Supplement 4, pgs 10.19.1-10.19.11 
(1992) John Wiley and sons, New York, NY. 

The starting plasmids can be constructed from available plasmids in accord with 
published, known procedures. In addition, equivalent plasmids to those described are known 
in the art and will be apparent to the ordinarily skilled artisan. 

Once homogeneous cultures of recombinant cells are obtained, large quantities of 
recombinantly produced protein can be recovered from the conditioned medium and analyzed 
using chromatographic methods well known in the art. An alternative method for the 
production of large amounts of secreted protein involves the transformation of mammalian 
embryos and the recovery of the recombinant protein from milk produced by transgenic cows, 
goats, sheep, etc. Polypeptides and closely related molecules may be expressed recombinantly 
in such a way as to facilitate protein purification. One approach involves expression of a 
chimeric protein which includes one or more additional polypeptide domains not naturally 
present on human polypeptides. Such purification-facuitating domains include, but are not 
hmited to, metal-chelating peptides such as histidine-tryptophan domains that allow purification 
on immobilized metals, protein A domains that allow purification on immobilized 
immunoglobuhn, and the domain utilized in the FLAGS extension/affinity purification system 
(Immunex Corp, Seattle, WA). The inclusion of a cleavable linker sequence such as Factor 
XA or enterokinase from Invitrogen (San Diego, CA) between the polypeptide sequence and 
the purification domain may be useful for recovering the polypeptide. 

It is also contemplated and within the scope of the present invention that the above 
recombinant antigens will be used in a variety of immunoassay formats, including but not 
limited to direct and indirect assays. The means for adapting the antigens to such various 
formats -- as by conjugation to labels or macromolecules, or immobilization on suitable support 
surfaces -- are well-understood and should be familiar to those skilled in the art. 

For example, the polypeptides including their fragments or derivatives or analogs 
thereof of the present invention, or cells expressing them, can be used for the detection of 
antibodies to fflV (as well as an immunogen to produce antibodies). These antibodies can be, 
for example, polyclonal or monoclonal antibodies, chimeric, single chain and humanized 
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antibodies, as well as Fab fragments, or the product of an Fab expression library. Various 
procedures known in the art may be used for the production of such antibodies and fragments. 

Further, antibodies generated against a polypeptide corresponding to a sequence of the 
present invention can be obtained by direct injection of the polypeptide into an animal or by 
administering the polypeptide to an animal such as a mouse, rabbit, goat or human. A mouse, 
rabbit or goat is preferred. The antibody so obtained then will bind the polypeptide itself. In 
this manner, even a sequence encoding only a fragment of the polypeptide can be used to 
generate antibodies that bind the native polypeptide. Such antibodies can then be used to 
isolate the polypeptide from test samples such as tissue suspected of containing that 
polypeptide. For preparation of monoclonal antibodies, any technique which provides 
antibodies produced by continuous cell line cultures can be used. Examples include the 
hybridoma technique as described by Kohler and Milstein, Nature 256:495-497 (1975), the 
trioma technique, the human B-cell hybridoma technique as described by Kozbor et al, Immun. 
Today 4:72 (1983), and the EBV-hybridoma technique to produce human monoclonal 
antibodies as described by Cole et al., in Monoclonal Antibodies and Cancer Therapy, Alan R. 
Liss, Inc, New York, NY, pp. 77-96 (1985). Techniques described for the production of 
single chain antibodies can be adapted to produce single chain antibodies to immunogenic 
polypeptide products of this invention. See, for example, U.S. Pat. No. 4,946,778, which is 
incorporated herein by reference. 

Various assay formats may utilize such antibodies, including "sandwich" 
immunoassays and probe assays. For example, the monoclonal antibodies or fragment as 
described above can be employed in various assay systems to determine the presence, if any, 
of HIV-derived polypeptide in a test sample. For example, in a first assay format, a polyclonal 
or monoclonal antibody or fragment thereof, or a combination of these antibodies, which has 
been coated on a solid phase, is contacted with a test sample, to form a first mixture. This first 
mixture is incubated for a time and under conditions sufficient to form antigen/antibody 
complexes. Then, an indicator reagent comprising a monoclonal or a polyclonal antibody or a 
fragment thereof, or a combination of these antibodies, to which a signal generating compound 
has been attached, is contacted with the antigen/antibody complexes to form a second mixture. 
This second mixture then is incubated for a time and under conditions sufficient to form 
antibody/antigen/antibody complexes. The presence of an HIV-derived polypeptide antigen 
present in the test sample and captured on the solid phase, if any, is determined by detecting the 
measurable signal generated by the signal generating compound. The amount of HIV-derived 
polypeptide antigen present in the test sample is proportional to the signal generated. 

Or, a polyclonal or monoclonal HIV-derived polypeptide antibody or fragment thereof, 
or a combination of these antibodies which is bound to a solid support, the test sample and an 
indicator reagent comprising a monoclonal or polyclonal antibody or fragments thereof, which 
specifically binds to HIV-derived polypeptide antigen, or a combination of these antibodies to 
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which a signal generating compound is attached, are contacted to form a mixture. This mixture 
is incubated for a time and under conditions sufficient to form antibody/antigen/antibody 
complexes. The presence, if any, of fflV-derived polypeptide present in the test sample and 
captured on the solid phase is determined by detecting the measurable signal generated by the 
signal generating compound. The amount of fflV-derived polypeptide proteins present in the 
test sample is proportional to the signal generated. 

In another assay format, one or a combination of at least two monoclonal antibodies can 
be employed as a competitive probe for the detection of antibodies to fflV-derived polypeptide 
protein. For example, fflV-derived polypeptide proteins such as the recombinant antigens 
disclosed herein, either alone or in combination, are coated on a solid phase. A test sample 
suspected of containing antibody to fflV-derived polypeptide antigen then is incubated with an 
indicator reagent comprising a signal generating compound and at least one monoclonal 
antibody for a time and under conditions sufficient to form antigen/antibody complexes of 
either the test sample and indicator reagent bound to the solid phase or the indicator reagent 
bound to the solid phase. The reduction in binding of the monoclonal antibody to the solid 
phase can be quantitatively measured. 

In yet another detection method, each of the monoclonal or polyclonal antibodies can be 
employed in the detection of fflV-derived polypeptide antigens in fixed tissue sections, as well 
as fixed cells by immunohistochemical analysis. Cytochemical analysis wherein these 
antibodies are labeled directly (with, for example, fluorescein, colloidal gold, horseradish 
peroxidase, alkaline phosphatase, etc.) or are labeled by using secondary labeled anti-species 
antibodies (with various labels as exemplified herein) may be used to track the histopathology 
of disease. 

In addition, these monoclonal antibodies can be bound to matrices similar to CNBr- 
activated Sepharose and used for the affinity purification of specific HIV-derived polypeptide 
proteins from cell cultures or biological tissues such as to purify recombinant and native fflV- 
derived polypeptide antigens and proteins. 

Monoclonal antibodies can also be used for the generation of chimeric antibodies for 
therapeutic use, or other similar applications. 

The monoclonal antibodies or fragments thereof can be provided individually to detect 
HIV-derived polypeptide antigens. Combinations of the monoclonal antibodies (and fragments 
thereof) also may be used together as components in a mixture or "cocktail" of at least one 
HIV-derived polypeptide antibodywith antibodies to other HIV-derived polypeptide regions, 
each having different binding specificities. Thus, this cocktail can include monoclonal 
antibodies which are directed to HIV-derived polypeptide proteins of fflV and other 
monoclonal antibodies to other antigenic determinants of the HIV-derived polypeptide genome. 

The polyclonal antibody or fragment thereof which can be used in the assay formats 
should specifically bind to an HIV-derived polypeptide region or other HIV-derived 
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polypeptide proteins used in the assay. The polyclonal antibody used preferably is of 
mammalian origin; human, goat, rabbit or sheep anti-HTV-derived polypeptide polyclonal 
antibody can be used. Most preferably, the polyclonal antibody is rabbit polyclonal anti-HIV- 
derived polypeptide antibody. The polyclonal antibodies used in the assays can be used either 
alone or as a cocktail of polyclonal antibodies. Since the cocktails used in the assay formats are 
comprised of either monoclonal antibodies or polyclonal antibodies having different HTV- 
derived polypeptide specificity, they would be useful for diagnosis, evaluation and prognosis 
of HIV-derived polypeptide condition, as well as for studying HIV-derived polypeptide protein 
differentiation and specificity. 

It is contemplated and within the scope of the present invention that HIV-derived 
polypeptides may be detectable in assays by use of recombinant antigens as well as by use of 
synthetic peptides or purified peptides, which contain amino acid sequences of HIV-derived 
polypeptides. It also is within the scope of the present invention that different synthetic, 
recombinant or purified peptides identifying different epitopes of each such HIV-derived 
polypeptide can be used in combination in an assay to diagnose, evaluate, or prognosticate the 
HIV disease condition. In this case, these peptides can be coated onto one solid phase, or each 
separate peptide may be coated on separate solid phases, such as microparticles, and then 
combined to form a mixture of peptides which can be later used in assays. Furthermore, it is 
contemplated that multiple peptides which define epitopes from different polypeptides may be 
used in combination to make a diagnosis, evaluation, or prognosis of HTV disease. Peptides 
coated on solid phases or labeled with detectable labels are then allowed to compete with 
peptides from a patient sample for a limited amount of antibody. A reduction in binding of the 
synthetic, recombinant, or purified peptides to the antibody (or antibodies) is an indication of 
the presence of BTV-secreted polypeptides in the patient sample which in turn indicates the 
presence of HTV gene in the patient Such variations of assay formats are known to those of 
ordinary skill in the art and are discussed herein below. 

In another assay format, the presence of antigens and/or antibodies to HIV-derived 
polypeptides can be detected in a simultaneous assay, as follows. A test sample is 
simultaneously contacted with a capture reagent of a first analyte, wherein said capture reagent 
comprises a first binding member specific for a first analyte attached to a solid phase and a 
capture reagent for a second analyte, wherein said capture reagent comprises a first binding 
member for a second analyte attached to a second solid phase, to thereby form a mixture. This 
mixture is incubated for a time and under conditions sufficient to form capture reagent/first 
analyte and capture reagent/second analyte complexes. These so-formed complexes then are 
contacted with an indicator reagent comprising a member of a binding pair specific for the first 
analyte labeled with a signal generating compound and an indicator reagent comprising a 
member of a binding pair specific for the second analyte labeled with a signal generating 
compound to form a second mixture. This second mixture is incubated for a time and under 
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conditions sufficient to form capture reagent/first analyte/indicator reagent complexes and 
capture reagent/second analyte/indicator reagent complexes. The presence of one or more 
analytes is determined by detecting a signal generated in connection with the complexes formed 
on either or both solid phases as an indication of the presence of one or more analytes in the 
test sample. In this assay format, recombinant antigens may be utilized as well as monoclonal 
antibodies produced therefrom. Such assay systems are described in greater detail in EP 
Publication No. 0473065. 

In yet other assay formats, the polypeptides disclosed herein may be utilized to detect 
the presence of antibodies specific for HIV-derived polypeptides in test samples. For example, 
a test sample is incubated with a solid phase to which at least one recombinant protein has been 
attached. These are reacted for a time and under conditions sufficient to form antigen/antibody 
complexes. Following incubation, the antigen/antibody complex is detected. Indicator 
reagents may be used to facilitate detection, depending upon the assay system chosen. In 
another assay format, a test sample is contacted with a solid phase to which a recombinant 
protein produced as described herein is attached and also is contacted with a monoclonal or 
polyclonal antibody specific for the protein, which preferably has been labeled with an 
indicator reagent. After incubation for a time and under conditions sufficient for 
antibody/antigen complexes to form, the solid phase is separated from the free phase, and the 
label is detected in either the solid or free phase as an indication of the presence of HIV-derived 
polypeptide antibody. Other assay formats utilizing the recombinant antigens disclosed herein 
are contemplated. These include contacting a test sample with a solid phase to which at least 
one antigen from a first source has been attached, incubating the solid phase and test sample 
for a time and under conditions sufficient to form antigen/antibody complexes, and then 
contacting the solid phase with a labeled antigen, which antigen is derived from the same 
source or, alternatively, a second source different from the first source. For example, a 
recombinant protein derived from a first source such as E. coli is used as a capture antigen on a 
solid phase, a test sample is added to the so-prepared solid phase, and a recombinant protein 
derived from a different source (i.e., non-E. coli) is utilized as a part of an indicator reagent. 
Likewise, combinations of a recombinant antigen on a solid phase and synthetic peptide in the 
indicator phase also are possible. Any assay format which utilizes an antigen specific for HIV- 
derived polypeptide from a first source as the capture antigen and an antigen specific for HIV- 
derived polypeptide from a second source are contemplated. Thus, various combinations of 
recombinant antigens, as well as the use of synthetic peptides, purified proteins, and the like, 
are within the scope of this invention. Assays such as this and others are described in U.S. 
Patent No. 5,254,458, which enjoys common ownership and is incorporated herein by 
reference. 

Other embodiments which utilize various other solid phases also are contemplated and 
are within the scope of this invention. For example, ion capture procedures for immobilizing 
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an immobilizable reaction complex with a negatively charged polymer (described in EP 
publication 0326100 and EP publication No. 0406473), can be employed according to the 
present invention to effect a fast solution-phase immunochemical reaction. An immobilizable 
immune complex is separated from the rest of the reaction mixture by ionic interactions 
between the negatively charged poly-anion/immune complex and the previously treated, 
positively charged porous matrix and detected by using various signal generating systems 
previously described, including those described in chemiluminescent signal measurements as 
described in EPO Publication No. 0 273,115. 

Also, the methods of the present invention can be adapted for use in systems which 
utilize microparticle technology including in automated and semi-automated systems wherein 
the solid phase comprises a microparticle (magnetic or non-magnetic). Such systems include 
those described in published EPO applications Nos. EP 0 425 633 and EP 0 424 634, 
respectively. 

The use of scanning probe microscopy (SPM) for immunoassays also is a technology 
to which the monoclonal antibodies of the present invention are easily adaptable. In scanning 
probe microscopy, in particular in atomic force microscopy, the capture phase, for example, at 
least one of the monoclonal antibodies of tiie invention, is adhered to a solid phase and a 
scanning probe microscope is utilized to detect antigen/antibody complexes which may be 
present on the surface of the solid phase. The use of scanning tunneling microscopy eUminates 
the need for labels which normally must be utilized in many immunoassay systems to detect 
antigen/antibody complexes. The use of SPM to monitor specific binding reactions can occur 
in many ways. In one embodiment, one member of a specific binding partner (analyte specific 
substance which is the monoclonal antibody of the invention) is attached to a surface suitable 
for scanning. The attachment of the analyte specific substance may be by adsorption to a test 
piece which comprises a solid phase of a plastic or metal surface, following methods known to 
those of ordinary skill in the art. Or, covalent attachment of a specific binding partner (analyte 
specific substance) to a test piece which test piece comprises a solid phase of derivatized 
plastic, metal, silicon, or glass may be utilized. Covalent attachment methods are known to 
those skilled in the art and include a variety of means to irreversibly link specific binding 
partners to the test piece. If the test piece is silicon or glass, the surface must be activated prior 
to attaching the specific binding partner. Also, polyelectrolyte interactions may be used to 
immobilize a specific binding partner on a surface of a test piece by using techniques and 
chemistries. The preferred method of attachment is by covalent means. Following attachment 
of a specific binding member, the surface may be further treated with materials such as serum, 
proteins, or other blocking agents to minimize non-specific binding. The surface also may be 
scanned either at the site of manufacture or point of use to verify its suitability for assay 
purposes. The scanning process is not anticipated to alter the specific binding properties of the 
test piece. 
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While the present invention discloses the preference for the use of solid phases, it is 
contemplated that the reagents such as antibodies, proteins and peptides of the present 
invention can be utilized in non-solid phase assay systems. These assay systems are known to 
those skilled in the art, and are considered to be within the scope of the present invention. 

The present invention will be better understood in connection with the following 
examples, which are meant to illustrate, but not to limit, the spirit and scope of the invention. 



Example 1 
Cloning Procedures 



Oligonucleotides for gene construction and sequencing were synthesized at Abbott 
Laboratories, Synthetic Genetics (San Diego, CA) or Oligo Etc. (Wilsonville, CA). All 
polymerase chain reaction (PCR) reagents, including AmpliTaq DNA polymerase and UlTma 
DNA polymerase, were purchased from Perkin-Elmer Corporation (Foster City, CA) and used 
according to the manufacturer's specifications unless otherwise indicated. PCR amplifications 
were performed on a GeneAmp 9600 thermal cycler (Perkin-Elmer). Unless indicated 
otherwise, restriction enzymes were purchased from New England BioLabs (Beverly, MA) 
and digests were performed as recommended by the manufacturer. DNA fragments used for 
cloning were isolated on agarose (Life Technologies, Gaithersburg, MD) gels, unless 
otherwise indicated. 

Desired fragments were excised and the DNA was extracted with a QIAEX II gel 
extraction kit or the QIAquick gel extraction kit (Qiagen Inc., Chatsworth, CA) as 
recommended by the manufacturer. DNA was resuspended in H 2 0 or TE [1 mM 
ethylenediaminetetraacetic acid (EDTA; pH 8.0; BRL Life Technologies), 10 mM 
1ris(hydroxymemyl)aminomethane-hydrochloride (Tris-HCl; pH 8.0; BRL Life 
Technologies)]. Ligations were performed using a Stratagene DNA ligation kit (Stratagene 
Cloning Systems, La Jolla, CA) as recommended by the manufacturer. Ligations were 
incubated at 16°C overnight 

Bacterial transformations were performed using MAX EFFICIENCY DH5cc competent 
cells (BRL Life Technologies) or Epicurian Coli XLl-Blue supercompetent ceUs (Stratagene 
Cloning Systems) following the manufacturer's protocols. Unless indicated otherwise, 
transformations and bacterial restreaks were plated on LB agar (Lennox) plates with 150 Hg/ml 
ampicillin (M1090; MicroDiagnostics, Lombard, IL) or on LB agar + ampicillin plates 
supplemented with glucose to a final concentration of 20mM, as noted. All bacterial 
incubations (plates and overnight cultures) were conducted overnight (~16 hours) at 37°C. 
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Screening of transformants to identify desired clones was accomplished by sequencing 
of miniprep DNA and/or by colony PCR. Miniprep DNA was prepared with a Qiagen Tip 20 
Plasmid Prep Kit or a Qiagen QIAwell 8 Plasmid Prep Kit following the manufacturer's 
specifications, unless otherwise indicated. For colony PCR screening, individual colonies 
were picked from transformation plates and transferred into a well in a sterile flat-bottom 
96-well plate (Costar, Cambridge, MA) containing 100 (0,1 sterile H 2 0. One-third of the 
volume was transferred to a second plate and stored at 4°C. The original 96-well plate was 
microwaved for 5 minutes to disrupt the cells. 1 ^1 volume then was transferred to a PCR tube 
as template. 9 \il of a PCR master mix containing 1 nl 10X PCR buffer, 1 (il 2 mM dNTPs, 1 
|al (10 pmol) sense primer, 1 (10 pmol) anti-sense primer, 0.08 \xl AmpliTaq DNA 
polymerase (0.4 units), and 4.2 H 2 0 was added to the PCR tube. Reactions were generally 
amplified for 20-25 cycles of 94°C for 30 seconds, 50-60°C (depending on primer annealing 
temperatures) for 30 seconds and 72°C for 60 seconds. Primers were dependent on the insert 
and cycle conditions were modified based on primer annealing temperatures and the length of 
the expected product After cycling, approximately 1/3 of the. reaction volume was loaded on 
an agarose gel for analysis. Colonies containing desired clones were propagated from the 
transfer plate. 

Unless otherwise indicated, DNA sequencing was performed on an automated ABI 
Model 373A Stretch Sequencer (Perkin Elmer). Sequencing reactions were set up with 
reagents from a FS TACS Dye Term Ready Reaction Kit (Perkin Elmer) and 250-500 ng 
plasmid DNA according to the manufacturer's specifications. Reactions were processed on 
Centri-Sep columns (Princeton Separations, Adelphia, N.J.) prior to loading on the Sequencer. 
Sequence data was analyzed using Sequencher 3.0 (Gene Codes Corporation, Ann Arbor, MI) 
and GeneWorks 2.45 (Oxford Molecular Group, Inc., Campbell, CA). 

Example 2 

De-termination of the env Sequence of th e HTV-1 Groun O Isolate HAM112 

Viral RNA was extracted from culture supernatants of human peripheral blood 
mononuclear cells infected with the HTV-1 Group O isolate designated HAM1 12 (H. Hampl et 
al., Infection 23:369-370 [1995]) using a QIAamp Blood Kit (Qiagen) and the manufacturer's 
recommended procedure. RNA was eluted in a 50 \il volume of nuclease-free water (5Prime- 
3Prime, Inc., Boulder, CO) and stored at -70°C. The strategy for obtaining the env region 
sequence involved cDNA synthesis and PCR (nested) amplification of four overlapping env 
gene fragments. The amplified products were sequenced directly on an automated ABI Model 
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373 A Stretch Sequencer. Amplification reactions were carried out with GeneAmp RNA PCR 
and GeneAmp PCR Kits (Perkin Elmer) as outlined by the manufacturer. Oligonucleotide 
primer positions correspond to the HIV-1 ANT70 env sequence (G. Myers et al., eds., supra). 
The primers envlOR [nucleotide (nt) 791-772; SEQ ID NO:62], envl5R (nt 1592-1574; SEQ 
ID NO:63), env22R (nt 2321-2302; SEQ ID NO:64), env26R (nt 250-232 3' of env; SEQ ID 
NO:65) were used for cDNA synthesis of fragments 1-4, respectively. Reverse transcription 
reactions were incubated at 42°C for 30 minutes then at 99°C for 5 minutes. First-round PCR 
amplifications consisted of 30 cycles of 95°C for 30 seconds, 52°C for 30 seconds, and 72°C 
for 1 minute using the primer combinations: envlF (nt 184-166 5" of env; SEQ ID NO:66) and 
envlOR (SEQ ID NO:62), env7F (nt 564-586; SEQ ID NO:67) and envl5R (SEQ ID NO:63), 
envl2F (nt 1289-1308; SEQ ID NO:68) and env22R (SEQ ID NO:64), envl9F (nt 2020-2040; 
SEQ ID NO:69) and env26R (SEQ ID NO:65) for fragments 1 through 4, respectively. For 
the second round of amplification (nested PCR), 5 |xl of the respective first-round PCR 
reactions was used as template along with the primer combinations env2F (nt 37-15 5' of env; 
SEQ ID NO:70) and env9R (nt 740-721; SEQ ID NO:71), env8F (nt 631-650; SEQ ID NO:72) 
and envl4R (nt 1437-1416; SEQ ID NO:73), envl3F (nt 1333-1354; SEQ ID NO:74) and 
env21R (nt 2282-2265; SEQ ID NO:75), env20F (nt 2122-2141; SEQ ID NO:76) and env25R 
(nt 11 1-94 3' of env; SEQ ID NO:77) for fragments 1 through 4, respectively. Second-round 
amplification conditions were identical to those used for the first round. Fragments were 
agarose gel-purified and extracted with a Qjagen QIAEX II Gel Extraction Kit. Fragments 
were sequenced directly with the primers used for nested PCR along with primers env4F (SEQ 
ID NO:78) and env5R (SEQ ID NO:79) for fragment 1; primers envlOF (SEQ ID NO:80), 
envllF (SEQ ID NO:81), envllR (SEQ ID NO:82), envl2F (SEQ ID NO:68), and AG1 
(SEQ ID NO:87) for fragment 2; primers envl5F (SEQ ID NO:83) and envl9R (SEQ ID 
NO:84) for fragment 3; primers env22F (SEQ ID NO:85) and env24R (SEQ ID NO:86) for 
fragment 4. The deduced amino acid sequence of env from the HIV-1 Group O isolate 
HAM1 12 (SEQ ID NO:61) is presented in FIGURE 1. 

Example 3 

Construction of Synthetic HTV-1 Grou p O env gp!20 /gt>41 Genes 

FIGURE 2 depicts the strategy used to generate synthetic HIV-1 Group O env 
gpl20/gp41 gene constructs. The env gpl20/gp41 sequences were based on the HIV-1 Group 
O isolate HAM1 12 (SEQ ID NO:61). Determination of the env sequence of HAM1 12 is 
outlined in Example 2, hereinabove. Oligonucleotides were designed that encode the C- 
terminal 45 amino acids of the env gpl20 and 327 amino acids of env gp41 (nucleotide #1 is 
the first base of the first codon of gpl20 in the synthetic gene). The synthetic gene has a 26 
amino acid deletion (nucleotides 643 through 720), relative to the native HAM112 gp41, that 
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encompasses a highly hydrophobic (H) region (transmembrane region) of gp41. Thus, the 
full-length synthetic gp41 gene constructed is 327 amino acids. 

In the synthetic oligonucleotides, the native HIV-1 codons were altered to conform to 
E. coli codon bias in an effort to increase expression levels of the recombinant protein in E. 
coli. See, for example, M. Gouy and C. Gautier, Nucleic Acids Research 10:7055 (1982); H. 
Grosjean and W. Fiers, Gene 18:199 (1982); J. Watson et al. (eds.), Molecular Biology of the 
Gene, 4th Ed., Benjamin Kumming Publishing Co., pp.440 (1987). The gene construction 
strategy involved synthesis of a series of overlapping oligonucleotides with complementary 
ends (Osyn-A through Osyn-L, depicted as A through L). When annealed, the ends served as 
primers for the extension of the complementary strand. 

The fragments then were amplified by PCR. This process ("PCR knitting" of 
oligonucleotides) was reiterated to progressively enlarge the gene fragment. Oligonucleotide 
Osyn-5' was designed for cloning into the PL vector pKRR826. The expression vector, 
pKRR826, is a modified form of the lambda pL promoter vector pSDKR816, described in 
U.S. Serial No. 08/314,570, incorporated herein by reference. pKRR826 is a high copy 
number derivative of pBR322 that contains the temperature sensitive cl repressor gene (Benard 
et al., Gene 5:59 [1979]). However, pKRR826 lacks the translational terminator rrnBtl and 
has the lambda pL and lambda pR promoters in the reverse orientation, relative to pSDKR816. 
The polylinker region of pKRR826 contains Eco RI and Bam HI restriction enzyme sites but 
lacks an ATG start codon. Optimal expression is obtained when the 5' end of the gene insert 
(including an N-terminal methionine) is cloned into the EcoRI site. Osyn-5' was designed to 
contain an Eco RI restriction site for cloning and an ATG codon (methionine) to provide for 
proper translational initiation of the recombinant proteins. The anti-sense oligonucleotides 
Osyn-03' (SEQ ID NO:15), Osyn-P3' (SEQ ID NO:16), and Osyn-M (M) (SEQ ID NO: 14) 
each contain two sequential translational termination codons (TAA,TAG) and a Bam HI 
restriction site. When outside primers Osyn-5' (SEQ ID NO:l 1) and Osyn-M (M) (SEQ ID 
NO:14) were used, a full-length gp41 (327 amino acids) gene was synthesized (pGO-HPL; 
SEQ ID NO:52). Outside oligonucleotides Osyn-5' (SEQ ID NO: 11) and Osyn-03' (SEQ ID 
NO: 15) resulted in a truncated gp41 product of 199 amino acids (pGO-9PL; SEQ ID NO:48). 
Alternatively, outside oligonucleotides Osyn-5' (SEQ ID NO:ll) and Osyn-P3' (SEQ ID 
NO: 16) resulted in a truncated gp41 product 169 amino acids in length (pGO-8PL; SEQ ID 
NO:58). 

The synthetic genes also were expressed as CMP-KDO synthetase (CKS) fusion 
proteins. PCR-mediated transfer of the synthetic genes from pKRR826 into pJO200 
(described in U.S. Serial No. 572,822, and incorporated herein by reference) was 
accomplished with an alternative outside sense oligonucleotide PCR primer (5' end), Osyn- 
5'CKS (SEQ ID NO:25). Osyn-5'CKS contained an Eco RI restriction site and resulted in the 
in-frame fusion of the synthetic gene insert to CKS in the expression vector pJO200. The 3' 
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outside primers (antisense) Osyn-M (SEQ ID NO:14), Osyn-03' (SEQ ID NO:15) and Osyn- 
P3' (SEQ ID NO: 16) were used in combination with Osyn-5'CKS (SEQ ID NO:25) to 
generate pGO-1 1CKS (SEQ ID NO:54), pG0-9CKS (SEQ ID NO:50), and pGO-8 CKS 
(SEQ ID NO:60), respectively. These steps are detailed hereinbelow. 

A. PCR Knitting of Synthetic Olig onucleotides. 

Three PCR reactions (100 jul volume) were set up as follows: 

(1) Reaction IB: AmpliTaq DNA polymerase (2.5U) and IX buffer, along with 40uM 
of each dNTP (dATP, dCTP, dGTP, and dTTP), 25 pmol each of oligonucleotides Osyn-A 
(SEQ ID NO:3) and Osyn-D (SEQ ID NO:5), and 0.25 pmol each of oligonucleotides Osyn-B 
(SEQ ID NO: 17) and Osyn-C (SEQ ID NO:4); 

(2) Reaction 2A: UlTma DNA Polymerase (3U) and IX buffer along with 1.5mM 

Mga 2 , 40nM of each dNTP, 25pmol each of oligonucleotides Osyn-E (SEQ ID NO:6) and 
Osyn-H (SEQ ID NO:9), and 0.25 pmol each of oligonucleotides Osyn-F (SEQ ID NO:7) and 
Osyn-G (SEQ ID NO:8); and 

(3) Reaction 3 A: UlTma DNA Polymerase (3U) and IX buffer along with 1.5mM 

MgCl 2 , 40uM of each dNTP, 25pmol each of oligonucleotides Osyn-I (SEQ ID NO: 10) and 
Osyn-L (SEQ ID NO: 13), and 0.25 pmol each of oligonucleotides Osyn-J (SEQ ID NO: 18) 
and Osyn-K (SEQ ID NO: 12). 

Amplifications consisted of 20 cycles of 97°C for 30 seconds, 52°C for 30 seconds and 
72°C for 60 seconds. Reactions were then incubated at 72°C for 7 minutes and held at 4°C. 
PCR-derived products IB, 2 A and 3B were gel isolated on a 1% agarose gel. 

B. PCR Knitting of PCR Products From Reactio n 1B and Reaction 2A. 

A PCR reaction was set up with UlTma DNA Polymerase (3U) and IX buffer along 
with 1.5mM MgCl 2 , 40(^M of each dNTP, 24.4pmol of oligonucleotide Osyn-5' (SEQ ID 

NO: 11), 25 pmol of oligonucleotide Osyn-P3" (SEQ ID NO: 16), and -10 ng each of gel- 
isolated IB and 2A products from Example 3, Section 1A, hereinabove. Cycling conditions 
were the same as in Example 3, Section 1A. A second round of amplification was used to 
generate more of the desired product This was performed by making an UlTma mix as 
described hereinabove (100 |il reaction volume) with 49 pmol Osyn-5* (SEQ ID NO: 11), 50 
pmol Osyn-P3' (SEQ ID NO: 16) and 5 (Xl of the PCR product from the first round as template. 
These reactions were incubated at 94°C for 90 seconds, and then cycled as above (Section 3A). 
The Osyn-5'/Osyn-P3' PCR product was gel-isolated as described hereinabove. 
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C. Cloning of the Osyn-S'-Osyn-PS' PC R Product. 

The Osyn-5'-Osyn-P3' PCR product was digested with the restriction endonucleases 
Eco RI + Bam HI and ligated into the vector pKRR826 (described hereinabove) that had been 
digested with Eco RI + Bam HI and gel-isolated. The ligation product was used to transform 
DH5ot competent cells. The desired clone was identified by colony PCR using 
oligonucleotides pKRREcoRI Forward (SEQ ID NO:38) and pKRRBamHI Reverse (SEQ ID 
NO:39). Miniprep DNA was prepared from an overnight culture of pGO-8 candidate clone A2 
and the Osyn-5'-Osyn-P3' plasmid insert was sequenced with the oligonucleotide primers 
pKRREcoRI Forward (SEQ ID NO:38), pKRRBamHI Reverse (SEQ ID NO:39), 41sy-l 
(SEQ ID NO:44), and 41sy-2 (SEQ ID N0:41). 

D. Modification of pGO-8 Candidate Clone A2 . 

A 100 ul volume PCR reaction was set up with UlTma DNA Polymerase (3U) and IX 

buffer, along with 1.5mM MgCl 2 , 40|lM of each dNTP, 50pmol of oligonucleotides Osyn-5'- 

repair (SEQ ID NO:24), 50 pmol Osyn-P3' (SEQ ID NO: 16), and ~1 ng of pGO-8 candidate 
clone A2 miniprep DNA as template (obtained from the reactions set forth hereinabove). The 
reaction was incubated at 94°C for 90 seconds, and then amplified with 20 cycles of 94°C for 
30 seconds, 50°C for 30 seconds and 72°C for 60 seconds. The Osyn-5'-repair/Osyn-P3' 
PCR product then was gel isolated and digested with Eco RI + Bam HI. The digested product 
was ligated into Eco RI + Bam HI digested pKRR826 vector. The ligation product was used 
to transform DH5a competent cells. The desired clone was identified by colony PCR using 
oligonucleotides pKRREcoRI Forward (SEQ ID NO:38) and pKRRBamHI Reverse (SEQ ID 
NO:39). An overnight culture of pGO-8 candidate clone 6 was set up and a miniprep DNA was 
prepared. The Osyn-5'repair/Osyn-P3' plasmid insert was sequenced with the oligonucleotide 
primers pKRREcoRI Forward (SEQ ID NO:38), pKRRBamHI Reverse (SEQ ID NO:39), 
41sy-l (SEQ ID NO:44), and 41sy-2 (SEQ ID NO:41). Based on the sequencing results, 
pGO-8 candidate clone #6 was designated pGO-8PL/DHSoc. SEQ ID NO:57 presents the 
nucleotide sequence of the coding region. FIGURE 5 presents the amino acid sequence of the 
pGO-8PL recombinant protein (SEQ ID NO:58). The pGO-8PL recombinant protein consists 
of a N-terminal methionine, 45 amino acids of env gpl20 (FflV-1 Group O, HAM112 isolate), 
and 169 amino acids of env gp41 (HIV-1 Group O, HAM112 isolate). 

E. Construction of pGO-8CKS/XLl . 

pGO-8CKS/XLl (SEQ ID NO:59 presents the nucleotide sequence of the coding 
region) encodes the recombinant protein pGO-8CKS. FIGURE 6 presents the amino acid 
sequence of pGO-8CKS (SEQ ID NO:60). This protein consists of 246 amino acids of CKS/ 
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polylinker, 45 amino acids of env gpl20 (HIV-1 Group O, HAM1 12 isolate), and 169 amino 
acids of env gp41 (HIV-1 Group O, HAM112 isolate). The construction of pGO-8CKS/XLl 
was accomplished as follows. 

A PCR reaction (100 (xl volume) was set up with UlTma DNA Polymerase (3U) and 

IX buffer along with 1.5mM MgCl 2 , 40|lM of each dNTP, 50pmol of Osyn-5'CKS (SEQ ID 

NO:25), 50 pmol Osyn-P3' (SEQ ID NO: 16), and 1 ng pGO-8PL clone #6 miniprep DNA. 
The reaction was incubated at 94°C for 90 seconds then amplified with 25 cycles of 94°C for 30 
seconds; 55°C for 30 seconds; 72°C for 90 seconds. Then, the Osyn-5'CKS/Osyn-P3' PCR 
product was gel isolated. EcoR I + Bam HI digested the Osyn-5'CKS/Osyn-P3' PCR product 
and the vector pJO200. The digested pJO200 vector was gel isolated and ligated to digested 
Osyn-5'CKS/Osyn-P3' PCR product. XLl-Blue supercompetent cells were transformed with 
the ligation and plated on LB + ampicillin plates supplemented with 20 mM glucose. Colonies 
were restreaked for isolation on the same type of plates. An overnight culture of clone 
pGO-8CKS/XLl was grown in LB broth + 100|ig/ml carbenicillin (Sigma Chemical Co.)+ 20 
mM glucose (Sigma Chemical Co.). Frozen stocks (0.5 ml overnight culture + 0.5 ml glycerol) 
were made and DNA was prepared for sequence analysis. The following oligonucleotides 
were used as sequencing primers: CKS-1 (SEQ ID NO:30), CKS-2 (SEQ ID NO:31), CKS-3 
(SEQ ID NO:32), CKS-4 (SEQ ID NO:33), 43461 (SEQ ID NO:2), 43285 (SEQ ID NO:l), 
41sy-lB (SEQ ID NO:29), 41sy-2B (SEQ ID NO:34), CKS176.1 (SEQ ID NO:19), and 
CKS3583 (SEQ ID NO:20). 

R Construction of pGO-9PL/DH5a . 

FIGURES 3A through 3D and show a diagrammatic representation of the steps 
involved in construction of pGO-9PL/DH5cc. pGO-9PL/DH5a encodes the recombinant 
protein pGO-9PL. SEQ ID NO:47 present the nucleotide sequence of the coding region of 
pGO-9PL/DH5a. FIGURE 7 illustrates the amino acid sequence of the pGO-9PL recombinant 
protein (SEQ ID NO:48). This protein consists of an N-terminal methionine, 45 amino acids 
of env gpl20 (HIV-1 Group O, HAM112 isolate), and 199 amino acids of env gp41 ( HIV-1 
Group O, HAM1 12 isolate). Construction of pGO-9PL/DH5ce was accomplished as follows. 

Step 1: A 100 \i\ PCR reaction was set up with UlTma DNA Polymerase (3U) and IX 
buffer, along with 1.5mM MgCl 2 , 40|xM of each dNTP, 50pmol of Osyn-5' (SEQ ID NO: 1 1), 

50 pmol of Osyn-H (SEQ ID NO:9), and ~2 ng of pGO-8 candidate clone 6 miniprep DNA 
(obtained from Example 3, Section D hereinabove) as template. The reaction was incubated at 
94°C for 120 seconds, and then amplified with 8 cycles of 94°C for 30 seconds, 55°C for 30 
seconds and 72°C for 60 seconds. 
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Step 2: A 100 |al PCR reaction was set up with UlTma DNA Polymerase (3U) and IX 
buffer along with 1.5mM MgCl 2 , 40uM of each dNTP, 50pmol of Osyn-5 1 (SEQ ID NO:l 1), 

50 pmol Osyn-03' (SEQ ID NO: 15), and 10 |il of the PCR reaction from step 1 as template. 
The reaction was incubated at 94°C for 120 seconds then amplified with 18 cycles of 94°C for 
30 seconds, 55°C for 30 seconds, 72°C for 60 seconds, followed by incubation at 72°C for 5 
minutes. 

The Osyn-570syn-03' PCR product (2A/2B) then was gel-isolated and digested with 
Eco RI + Bam HI. The digested product was ligated into Eco RI + Bam HI digested 
pKRR826 vector. The ligation product next was used to transform DH5a competent cells. An 
overnight culture of pGO-9PL candidate clone 3 was set up and a miniprep DNA was 
prepared. The Osyn-5'/Osyn-03' plasmid insert was sequenced with the following 
oligonucleotides as primers: pKRREcoRl Forward (SEQ ID NO:38), pKRRBamffl Reverse 
(SEQ ID NO:39), 41sy-lC (SEQ ID NO:40), 41sy-2 (SEQ ID NO:41), 41sy-3 (SEQ ID 
NO:42) and 41sy-4 (SEQ ID NO:23). pGO-9PL clone #3 then was restreaked for isolation. 
An isolated colony was picked, an overnight culture of it was grown, and a frozen stock 
(0.5ml glycerol + 0.5ml overnight culture) was made. The stock was stored at -80°C. The 
sequence was confirmed using the primers indicated hereinabove, and this clone was 
designated as pGO-9PL/DH5oc (SEQ ID NO:47 presents the nucleotide sequence of the coding 
region, and SEQ ID NO:48 presents the amino acid sequence of coding region). pGO- 
9PL/DH5a was restreaked, an overnight culture was grown, and a miniprep DNA was 
prepared (this prep was designated as H5). 

fi. Construct! "" nfpGO-9CKS/XLl 

FIGURE 3 A through 3D show a diagrammatic representation of the steps involved in 
construction of pGO-9CKS/XLl. pGO-9CKS/XLl encodes the recombinant protein 
pGO-9CKS. FIGURE 8 presents the amino sequence of the pGO-9CKS recombinant protein 
(SEQ ID NO:50). This protein consists of 246 amino acids of CKS and polylinker followed 
by 45 amino acids of env gpl20 (HIV-1 Group O, HAM112 isolate), and 199 amino acids of 
env gp41 (HIV-1 Group O, HAM112 isolate). The construction of pGO-9CKS/XLl was 
accomplished as follows. 

Two PCR reactions (100 |xl volume) were set up with UlTma DNA Polymerase (3U) 

and IX buffer, along with 1.5mM MgCl 2 , 40|iM of each dNTP, 50pmol of Osyn-5'CKS 

(SEQ ID NO:25), 50 pmol Osyn-03' (SEQ ID NO: 15) and 1 ng pGO-9PL candidate clone 3 
miniprep DNA (obtained from Example 3, Section F, hereinabove). Each reaction was 
incubated at 94°C for 120 seconds, then amplified with 24 cycles of 94°C for 30 seconds, 55°C 
for 30 seconds, 72°C for 120 seconds, followed by incubation at 72°C for 5 minutes. The 
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Osyn-5'CKS/Osyn-03' PCR product then was gel isolated. The Osyn-5'CKS/Osyn-03' PCR 
product and the vector pJO200 was digested with EcoR I + Bam HI. The digested pJO200 
vector was gel isolated and ligated to the digested Osyn-5'CKS/Osyn-03' PCR product. XL1- 
Blue supercompetent cells were transformed with the ligation and plated on LB + ampicillin 
plates supplemented with 20 mM glucose. Colonies were restreaked for isolation on the same 
type of plates. An overnight culture of clone pGO-9CKS candidate clone 4 was grown in LB 
broth + 100 mg/ml carbenicillin (Sigma Chemical Co.)+ 20 mM glucose (Sigma Chemical 
Co.). Made frozen stocks (0.5 ml overnight culture + 0.5 ml glycerol) and prepared DNA for 
sequence analysis. The following oligonucleotides were used as sequencing primers: CKS-1 
(SEQ ID NO:30), CKS-2 (SEQ ID NO:31), CKS-3 (SEQ ID NO:32), CKS-4 (SEQ ID 
NO:33), 43461 (SEQ ID NO:2), 43285 (SEQ ID NO:l), 41sy-lB (SEQ ID NO:29), 41sy-2B 
(SEQ ID NO:34), 41sy-3B (SEQ ID NO:35), CKS 176.1 (SEQ ID NO: 19), CKS3583 (SEQ 
ID NO:20), and pTB-S8 (SEQ ID NO:28). Clone pGO-9CKS candidate clone 4 was 
designated as pGO-9CKS/XLl (SEQ ID NO:49 presents the nucleotide sequence of coding 
region, and SEQ ID NO:50 presents the amino acid sequence of coding region). 

H. Construction of Osvn I-M Fragment . 

The Osyn-O-M fragment was constructed as follows. A 100 |il PCR reaction was set 

up using AmpliTaq DNA Polymerase (2.5U), IX buffer, 50 |iM of each dNTP, 50pmol I- 
PCR (SEQ ID NO:26), 50 pmol Osyn-M (SEQ ID NO: 14) and 10 ng of gel-isolated PCR 
fragment 3A (Example 3, section A, hereinabove). The reaction was incubated at 95°C for 105 
seconds, and then it was amplified with 15 cycles of 95°C for 30 seconds, 55°C for 30 
seconds, 72°C for 60 seconds, and then it was held at 72°C for 7 minutes. The product, 
designated as Osyn I-M, was gel-isolated and cloned into the PCR H vector (TA Cloning Kit ; 
Invitrogen, San Diego, CA) following the manufacturer's recommended procedure. The 
resulting ligation product was used to transform DH5a competent cells. Plasmid miniprep 
DNA was generated from an overnight culture of clone IM-6, and the gene insert was 
sequenced with oligonucleotides 56759 (SEQUENCE ID NO: 45) and 55848 (SEQ ID 

NO:46). 

T. Synthesis and Knitting of PCR Fr apments I/6R and IM-6F. 
These procedures were performed as follows. 

Step 1: The following PCR reactions (100 jil volume) were set up: (a) I/6R with 
AmpliTaq DNA Polymerase (2.5U), IX buffer, 50 \sM of each dNTP, 50pmol I-PCR (SEQ 
ID NO:26), 50 pmol IM-6R (SEQ ID NO:22) and 281 ng of clone Evl-6 (obtained from 
Example 3, Section H) as template; (b) 6F/M with AmpliTaq DNA Polymerase (2.5U), IX 
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buffer, 50 of each dNTP, 50pmol IM-6F (SEQ ID NO:21), 50 pmol M-PCR (SEQ ID 
NO:27) and 281 ng of clone IM-6 (obtained from Example 3, Section H) as template. 

The reactions were incubated at 95°C for 105 seconds, and then amplified with 20 
cycles of 94°C for 15 seconds, 60°C for 30 seconds, 72°C for 60 seconds, then incubated at 
72°C for 7 minutes. The PCR products I/6R and 6F/M next were gel isolated following the 
procedures as described hereinabove. 

Step 2: A PCR reaction (100 |il volume) was set up with UlTma DNA Polymerase 

(3U) and IX buffer along with 1.5mM MgCl 2 , 40|lM of each dNTP, 50pmol of I-PCR (SEQ 

ID NO:26), 50 pmol M-PCR (SEQ ID NO:27), -50 ng I/6R, and ~20ng 6F/M. The reaction 
was incubated at 95°C for 105 seconds, and then it was amplified with 20 cycles of 94°C for 15 
seconds, 55°C for 30 seconds, 72°C for 60 seconds, followed by incubation at 72°C for 7 
minutes. The PCR product was processed on a Centri-sep column (Princeton Separations) 
following the manufacturer's instructions. 

I Constmctinn ofpGO-1 1PT,/DH5fx. 

FIGURES 4A through 4F show a diagrammatic representation of the steps involved in 
construction of pGO-1 lPL/DH5a. pGO-1 1PL/ DH5a encodes the recombinant protein pGO- 
1 1PL. FIGURE 9 presents the amino acid sequence of the pGO-1 1PL recombinant protein 
(SEQ ID NO:52). This protein consists of an N-terminal methionine, 45 amino acids of env 
gpl20 (HIV-1 Group O, HAM112 isolate), and 327 amino acids of env gp41 (fflV-1 Group 
O, HAM1 12 isolate). pGO-1 1PL/ DH5cc was constructed as follows. 

The final PCR product from Example 3, Section I and pGO-9PL vector (miniprep H5 
from Example 3, section F) were digested sequentially with Age I and Bam HI. The digested 
pGO-9PL was then treated with calf intestinal alkaline phosphatase (BRL Life Technologies) 
for 15 minutes at 37°C, phenol/chloroform extracted, and precipitated with NaOAc and EtOH. 
The vector (pGO-9PL) was subsequently gel-isolated. The digested pGO-9PL and the 
digested PCR product were ligated, and the ligation product was used to transform DH5a 
competent cells. Colonies were restreaked for isolation. Clone pGOl 1-4 then was identified 
and restreaked for isolation. An overnight culture of pGOl 1-4 was prepared in order to 
generate frozen stocks and perform miniprep DNA for sequencing. Clone pGOl 1-4 was 
sequenced with the following oligonucleotide primers: pKRREcoRl Forward (SEQ ID 
NO:38), pKRRBamHI Reverse (SEQ ID NO:39), 41sy-lC (SEQ ID NO:40), 41sy-2 (SEQ ID 
NO:41), 41sy-3 (SEQUENCE ID NO: 42), 41sy-4 (SEQ ID NO:23), 41sy-5B (SEQ ID 
NO:43), 41sy-5C (SEQ ID NO:36) and 41sy-6B (SEQ ID NO:37). Based on the sequencing 
results, this clone was designated as pGO-1 lPL/DH5oc (SEQ ID NO:51 presents the nucleotide 
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sequence of the coding region, and SEQ ID NO:52 presents the amino acid sequence of coding 
region). 

K. Construction nf pGQ-1 1CKS/XL1 . 

FIGURES 4A through 4G show a diagrammatic representation of the steps involved in 
construction of pGO- 1 1CKS/XL1. pGO- 1 1CKS/XL1 encodes the recombinant protein pGO- 
11CKS. FIGURE 10 shows the amino sequence of the pGO-llCKS recombinant protein 
(SEQ ID NO:54). This protein consists of 246 amino acids of CKS and polylinker followed 
by 45 amino acids of env gpl20 (HIV-1 Group O, HAM112 isolate), and 327 amino acids of 
env gp41 ( HIV-1 Group O, HAM112 isolate). pGO-llCKS/XLl was constructed as 
follows. 

A PCR reaction (100 [ll volume) was set up with UlTma DNA Polymerase (3U) and 

IX buffer along with 1.5mM MgCl 2 , 40^iM of each dNTP, 50pmol of Osyn-5'CKS (SEQ ID 

NO:25), 50 pmol Osyn-M (SEQ ID NO: 14), and 1 ng pGOl 1-4 (obtained from Example 3, 
Section J) as template. The reaction was incubated at 94°C for 105 seconds, and then amplified 
with 20 cycles of 94°C for 30 seconds, 55°C for 30 seconds, 72°C for 120 seconds, followed 
by incubation at 72°C for 7 minutes. The Osyn-5'CKS/Osyn-M PCR product was gel isolated. 
Next, the Osyn-5'CKS/Osyn-M PCR product and the vector pJO200 were EcoR I + Bam HI 
digested. The digested pJO200 vector was gel isolated. Overnight (16°C) ligations were set up 
with the digested PCR product. XLl-Blue supercompetent cells were transformed with the 
ligation and plated on LB + ampicillin plates supplemented with 20 mM glucose. Colonies 
were restreaked for isolation on the same plates. An overnight culture (LB medium + 
100|lg/ml carbenicillin + 20 mM glucose) of clone pGO-llCKS clone candidate 2 then was set 
up. Frozen stocks (0.5 ml 80% glycerol + 0.5 ml overnight culture) were made as well as 
miniprep DNA for sequencing. The following oligonucleotides were used as primers for 
sequence analysis: CKS-1 (SEQ ID NO:30), CKS-2 (SEQ ID NO:31), CKS-3 (SEQ ID 
NO:32), CKS-4 (SEQ ID NO:33), 43461 (SEQ ID NO:2), 43285 (SEQ ID NO:l), 41sy-lB 
(SEQ ID NO:29), 41sy-2B (SEQ ID NO:34), 41sy-3B (SEQ ID NO:35), 41sy-4 (SEQ ID 
NO:23), 41sy-5C (SEQ ID NO:36), 41sy-6B (SEQ ID NO:37), CKS176.1 (SEQ ID NO: 19), 
CKS3583 (SEQ ID NO:20), and pTB-S8 (SEQ ID NO:28). pGO-llCKS clone #2 was 
designated as pGO-1 1CKS/XL1. SEQ ID NO:53 presents the nucleotide sequence of the 
coding region of pGO-1 1CKS/XL1, and SEQ ID NO:54 presents the amino acid sequence of 
the coding region of pGO-1 1CKS/XL1. 
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Example 4 
Construction of pHTV210/XLl-Blue 

FIGURE 1 1 presents the amino acid sequence of the pHIV-210 recombinant protein 
(SEQ ID NO:55). This protein consists of 247 amino acids of CKS/linker sequences, 60 
amino acids from env gpl20 (#432-491; HIV-2 isolate D194.10), and 159 amino acids of env 
gp36 (#492-650; HIV-2 isolate D194.10). The construction of pfflV210/XLl-Blue was 
accomplished as follows. 

The genomic DNA of HIV-2 isolate D194.10 [H. Kuhnel et al., Nucleic Acids 
Research 18: 6142 (1990)] was cloned into the EMBL3 lambda cloning vector. See H. Kuhnel 
et al., Proc. Nat'l. Acad. Sci. USA 86: 2383-2387 (1989), and H. Kuhnel et al., Nucleic 
Acids Research 18: 6142 (1990), incorporated herein by reference. The lambda clone 
containing D194.10 (lambda A10) was obtained from Diagen Corporation (Diisseldorf, 
Germany). A PCR reaction (100 ul volume) was set up using AmpliTaq DNA polymerase 
(3.75 units), 200uM each dATP, dCTP, dGTP, and dTTP, 0.5 ug primer 3634 (SEQ ID 
NO:88; annealing to positions 7437-7455 on the HIV-2 isolate D194.10 (EMBL accession 
#X52223), 0.5 ug primer 3636 (SEQ ID NO:89, annealing to positions 8095-8077), IX PCR 
buffer, and 5 ul of the lambda A10 DNA diluted 1:50. The reaction was incubated 5 minutes at 
94°C then amplified with 35 cycles of 94°C for 1 minute, 45°C for 1 minute, 72°C for 2 
minutes; followed by an incubation at 72°C for 5 minutes. The PCR reaction was extracted 
with phenol/chloroform (Boehringer Mannheim Corporation, Indianapolis, IN) and the DNA 
was ethanol (AAPER Alcohol & Chemical Company, Shelbyville, KY) precipitated. The DNA 
was digested with EcoRI + Bam HI and gel purified on an 1.5% agarose gel (SeaKem GTG 
agarose, FMC Corporation, Rockland, Maine). The purified product was ligated into EcoRI + 
Bam HI digested pJO200 vector using 800 units of T4 DNA Hgase (New England BioLabs). 
XLl-Blue supercompetent cells (Stratagene) were transformed with 2 ul of the ligation as 
outlined by the manufacturer and plated on LB plates supplemented with ampiciUin (Sigma 
Chemical Company). Overnight cultures were established by inoculating single colonies into 
Superbroth II media (GIBCO BRL, Grand Island, NY) supplemented with 50 ug/ml ampicillin 
(Sigma) and 20mM glucose (Sigma). Frozen stocks were established by adding 0.3 ml of 
80% glycerol to 0.7 ml of overnight After mixing stocks were stored at -70°C. Miniprep 
DNA was prepared from the overnight cultures using the alkaline lysis method followed by 
PEG precipitation. Sequence reactions were performed with a 7-deaza-dGTP Reagent Kit with 
Sequenase Version 2.0 (United States Biochemical Corporation, Cleveland, OH) as outlined 
by the manufacturer. Reactions were run on 6% acrylamide gels (GIBCO BRL Gel-Mix 6) 
using the IBI gel apparatus as recommended by the manufacturer. Based on sequencing 
results, pHTV-210 clone #7 was designated as pHIV-210. The amino acid sequence of the 
pHIV-210 coding region is presented as SEQ ID NO:55. 
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Example 5 

Growth And Induction of E. coli Strains with 
HTV-1 Group O Recombinant gp41 Antigen Construct 

Overnight seed cultures of pGO-9CKS/XLl and pGO-1 1CKS/XL1 were prepared in 
500 ml sterile Excell Terrific Broth (available from Sigma Chemical Corp., St. Louis Mo.) 
supplemented with 100 (Xg/ml sodium ampicillin, and placed in a shaking orbital incubator at 
32°C or 37°C. One hundred milliliter (100 ul) inocula from seed cultures were transferred to 
flasks containing 1 liter sterile Excell Terrific Broth supplemented with 100 ng/ml sodium 
ampicillin. Cultures were incubated at 37°C until the culture(s) reached mid-logarithmic growth 
and then induced with 1 mM ITPG (isopropylthiogalactoside) for 3 hours at 37°C. (In the case 
of PL vector constructs, cultures were incubated at 32°C until the culture(s) reached mid- 
logarithmic growth and then induced for 3 hours by shifting the temperature of the culture(s) to 
42°C.) After the induction period, cells were pelleted by centrifugation and harvested 
following standard procedures. Pelleted cells were stored at -70°C until further processed. 

Example 6 

Isolation and Solubilization of HTV-1 Group O Recom binant. gn41 Antigen Produced as 

Insoluble Inclusion Bodies in E. coli 

Frozen cells obtained from Example 5 were resuspended by homogenization in cold 
lysis buffer comprising 50 mM Tris pH 8, 10 mM Na EDTA , 150 mM NaCl, 8% (w/v) 
sucrose, 5% Triton X-100® (v/v), 1 mM PMSF and 1 |lM pepstatin A. Lysozyme was added 
to the homogenates at a concentration of 1.3 mg per gram of cells processed, and the resultant 
mixture was incubated for 30 minutes on ice to lyse the cells. Inclusion bodies were separated 
from soluble proteins by centrifugation. These pelleted inclusion bodies were washed and 
pelleted sequentially in (1) Lysis Buffer; (2) 10 mM Na EDTA pH 8, 30% (w/v) sucrose; and 
(3) water. The washed inclusion bodies were resuspended in 50 mM Tris pH 8, 10 mM Na 
EDTA, 150 mM NaCl and 3 M urea, and incubated on ice for 1 hour. The inclusion bodies 
then were separated from the solubilized proteins by centrifugation. The pelleted inclusion 
bodies were fully solubilized in 7 M guanidine-HCl, 50 mM Tris pH 8, 0.1% (v/v) beta- 
mercaptoethanol (BME) overnight at 4°C. The solubilized recombinant antigens were clarified 
by centrifugation, passed through a 0.2 ^m filter and stored at <-20°C until purified by 
chromatography. 
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Example 7 

Purification of Recombinant HIV- 1 Group O gp41 Antigen bv Chromatography 

Solubilized HTV-1 Group O recombinant gp41 antigens obtained from Example 6 were 
5 purified by a two-step method, as follows. Guanidine-HCl extracts of insoluble antigens were 
purified by size exclusion chromatography on a Sephacryl S-300 column equilibrated with 50 
mM Tris pH 8, 8 M Urea and 0.1% BME (v/v). SDS-polyacrylamide electrophoresis was 
used to analyze fractions. Fractions containing the recombinant gp41 antigen were pooled and 
then concentrated by ultrafiltration. The recombinant antigen concentrate was treated with 4% 
10 SDS (w/v) and 5% BME (w/v) at room temperature for 3 hours. SDS treated antigen was 

further purified by size exclusion chromatography on a Sephacryl S-300 column equilibrated 
with 25 mM Tris pH 8, 0.15 M Nad, 0.1% v/v BME, 0.1% SDS (w/v). SDS- 
polyacrylamide electrophoresis was used to analyze the fractions. Fractions containing 
purified recombinant antigen were pooled, passed through a 0.2 |jm filter and stored at -70° C. 

15 

Example 8 
Preparation of HIV- 1 Group M antigen 

Cells containing the plasmid pTB319 were grown and induced as described in Example 
20 5. Cells were lysed and inclusion bodies were processed essentially as described in Example 5 
of U.S. Patent No: 5,124,255, incorporated herein by reference. The pellet material was 
subsequently solubilized in SDS, Phosphate, pH 6.8 and then subjected to chromatography on 
an S-300 column. 

25 Example 9 

Preparation of HTV-2 antigen 

pfflV-210/XLl-Blue cells (Example 4, hereinabove) were grown and induced as 
described in Example 5. Cells were lysed with a buffer containing phosphate, MgCl 2 > Na 

30 EDTA, Triton X-100® pH 7.4 supplemented with Benzonase, Lysozyme, and PMSF. 
Inclusion bodies were separated from soluble proteins by centrifugation. The pellet was 
washed sequentially with: distilled H 2 0; Triton X-100®, deoxycholate, NaCl, Phosphate pH 
7.0; 50 mM Phosphate, pH 7.0; urea, SDS in phosphate, pH 7.0 + BME. Proteins were 
solubilized in SDS, phosphate, pH 7.0 and BME then subjected to chromatography on an 

35 S300 column. 
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Example 10 

One-Step Immunochromatographic Assay for Simu ltaneous Detection 
and Differentiation of HIV-1 Group M. HIV-1 Group O and HIV-2 

5 A. Reagent preparation 

1. A selenium (Se) colloid suspension was prepared substantially as follows: Se0 2 
was dissolved in water to a concentration of 0.0625 gm/ml. Ascorbate then was dissolved in 
water to a concentration of 0.32 gm/ml and heated in a 70°C water bath for 24 hours. The 
ascorbate solution then was diluted to 0.0065 gm/ml in water. The Se0 2 solution was quickly 

10 added to the diluted ascorbate solution and incubated at 42°C. Incubation was ended after a 
minimum of 42 hours when the absorbance maximum exceeded 30 at a wavelength between 
542 nm and 588 nm. The colloid suspension was cooled to 2-8 °C, then stored. Selenium 
colloid suspension is available from Abbott Laboratories, Abbott Park, Illinois (Code 25001). 

2. Selenium colloid/antibody conjugates were prepared as follows. The selenium 
15 colloid suspension was concentrated to an absorbance of 25 (OD 500-570) in distilled water. 

Then, 1M MOPS was added to a final concentration of 10 mM pH 7.2. Goat antibodies 
specific for human IgG Fc region (or other species of antibody specific for human IgG Fc 
region) were diluted to a concentration of 0.75 mg/ml with 50 mM Phosphate buffer, and the 
resultant antibody preparation then was added with mixing to the selenium colloid suspension 

20 prepared as described hereinabove, to a final antibody concentration of 75|ag/ml. Stirring was 
continued for 40 minutes. Then, 1% (by weight) bovine serum albumin (BSA) was added to 
the solution, and the selenium colloid/antibody conjugate solution was stirred for an additional 
15 minutes and centrifuged at 5000 x g for 90 minutes. Following this, 90% of the 
supernatant was removed, and the pellet was resuspended with the remaining supernatant. 

25 Immediately prior to coating this selenium-IgG conjugate to a glass fiber pad, it was diluted 

1:10 with conjugate diluent (1% [by weight] casein, 0.1% [weight] Triton X-405®, and 50 mM 
Tris, pH 8.2). 

3. Procedural control reagent was prepared as a mixture of HIV-1 (group M), HIV-1 
(group O), and HIV-2 positive sera, and is utilized on a separate strip device as a positive 

30 control of the assay. 

4. Negative control reagent used was normal human utilized on a separate test device 

as a negative control of the assay. 

B. Application pad preparation. 
35 The application pad material comprises resin bonded glass fiber paper (Lydall). 

Approximately 0. 1 ml of the prepared conjugate (described in preceding paragraph 2) is applied 
to the application pad. 
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C. Chromatographic Material Preparation . 

All reagents are applied to a nitrocellulose membrane by charge and deflect reagent 
jetting. The nitrocellulose is supported by a MYLAR® membrane that is coated with a pressure 
sensitive adhesive. 

5 The test sample capture reagents were prepared by (a) diluting the specific antigen 

prepared as described hereinabove to a concentration of 0.5 mg/ml in jetting diluent (100 mM 
Tris, pH 7.6 with 1% sucrose (by weight), 0.9% NaCl and 5 |LLg/ml fluorescein) for HIV-1 
group O capture reagent (pGO-9/CKS, SEQ ID NO:50), (b) for HIV-1 group M, subgroup B 
capture reagent (pTB319, SEQ ID NO:56), and (c) for HIV-2 capture reagent (pfflV-210, SEQ 

10 ID NO:55). 0.098 |ul of a first capture reagent (reagent HIV-1 group M subgroup B; SEQ ID 
NO:56) was applied to the strip at the designated capture location and constituted one patient 
capture site. Likewise, 0.098 \xl of a second capture reagent (reagent HIV-1 group O; SEQ ID 
NO:50) was applied to the strip at the designated capture location and constituted one patient 
capture site, and 0.098 |Lil of a third capture reagent (reagent HIV-2; SEQ ID NO:55) was 

15 applied to the strip at the designated capture location and constituted one patient capture site. 

D. Ranid assay for the presence of antibodies to HIV . 

A rapid assay for the presence of antibodies to HIV in test samples serum, whole 
blood, saliva, and urine samples was performed as follows. In a 1.5 ml Eppendorf tube, 5 (il 

20 of serum and 600 |il of sample elution buffer (SEB) (containing 50 mM Tris, 1% BSA (w/v), 
0.4% Triton X-405® (v/v), 1.5% Casein (w/v), 3% Bovine IgG (w/v), 4% E. coli lysate (v/v), 
[pH 8.2]) was mixed. Four drops of this mixture was applied to the sample well of the STAR 
housing. Next, 1 jlxI of serum or whole blood was added to 100 |il of SEB in a well of a 
microtiter plate, and the nitrocellulose strip was added in the well. Following this, 1 |il of 

25 serum or whole blood was spotted in the test device of the invention's sample well directly and 
4 drops of SEB was added. When testing saliva, 50 or 75 |il of saliva was added to 50 |il or 
25 |il of SEB, respectively, in a well of a microtiter plate, and the nitrocellulose test strip then 
was added to the well. When testing urine, 50 |ll of urine was added to 50 ul of SEB in a well 
of a microtiter plate, and the nitrocellulose test strip was added in the well. Alternatively, 100 

30 [il of urine was used in the well of a microtiter plate, and the nitrocellulose test strip was added, 
without using SEB. 

The IgG in the sample was bound by the selenium-goat anti-human IgG colloid in the 
conjugate pad, and the complexes were chromatographed along the length of the nitrocellulose 
membrane test strips on which the three recombinant antigens pGO-9 CKS SEQ ID NO:50), 
35 pTB319 (HIV-1 group M (subgroup B), SEQ ID NO:56) and pfflV210 (HIV-2, SEQ ID 

NO:55) previously were applied at a concentration of 1 mg/ml using a biodot machine, which 
provided positive displacement dispensing using precise drop sizes. The test device then was 
incubated at room temperature for two minutes, and the results were read visually. 
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E. Spiked Whole Blood Assay , 

In a 1.5 ml Eppendorf tube, the equivalent of 1 |il blood from either confirmed positive 
HIV-1 group O, HIV-1 group M or HIV-2, or confirmed negative for HIV-1 group O, HIV-1 
5 group M or HIV-2 whole blood test sample was added to 5 |al of a confirmed negative HIV- 1 
group O, HIV-1 group M or HIV-2 serum along with 100 |il of SEB, and mixed. This 
mixture was applied to the sample well of the test device of the invention. 

The IgG in the sample was bound by the selenium-goat anti-human IgG colloid in the 
conjugate pad, and the complexes were chromatographed along the length of the nitrocellulose 
10 membrane test strips on which the three recombinant antigens pGO-9 CKS SEQ ID NO:50), 
pTB319 (HIV-1 group M (subgroup B), SEQ ID NO:56) and pfflV210 (HIV-2, SEQ ID 
NO:55) previously were applied at a concentration of 1 mg/ml using a biodot machine, which 
provided positive displacement dispensing using precise drop sizes. The test device then was 
incubated at room temperature for two minutes, and the results were read visually. 

15 

R Results. 

If antibody to antigen 1 was present in the test sample, a visible reaction was indicated 
in the capture zone area of antigen 1 and in the assay completion zone, and not in the zones of 
antigen 2 or antigen 3. If antibody to antigen 2 was present in the test sample, a visible 

20 reaction was indicated in the capture zone area of antigen 2 and in the assay completion zone, 
and not in the zones of antigen 1 or antigen 3. If antibody to antigen 3 was present in the test 
sample, a visible reaction was indicated in the capture zone area of antigen 3 and in the assay 
completion zone, and not in the zones of antigen 1 or antigen 2. Also, a negative control 
should be non-reactive (show no visible reaction) in the zones of antigen 1, antigen 2 and 

25 antigen 3, but should be reactive in the assay completion zone. A positive control (known 
reactive antibody to antigen 1, 2 and/or 3) should be reactive in the zone of the appropriate 
antigen to which it specifically binds in an antigen/antibody reaction. A result was considered 
invalid when a positive reaction occurred in one of the antigen capture zones but not in the 
assay completion zone, and the test was repeated. 

30 (i) Assaying for Antibodies in Blood, Urine and Saliva. The blood, urine, and saliva 

of three patients (identified by patient numbers 0109, 4068, and 4475) were tested on 
nitrocellulose solid phase devices of the invention as described herein and following the assay 
protocol as set forth hereinabove. Each blood and urine test sample of each patient 0 109, 4068 
and 4475 was reactive with antigen 1 (pTB319; SEQ ID NO:56). The saliva test sample of 

35 patients 4068 and 4475 also were reactive with antigen 1, while patient 0109's saliva test 

sample was non-reactive in the test device of the invention. The saliva test sample of patient 
0109 was later retested by a standard EIA and confirmed non-reactive for antibodies to HIV-1 
gp41, indicating that the results obtained for the saliva test sample of patient 0109 were valid. 
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(ii) Assaying Negative Samples for HTV Antibodies. Two negative sera and two 
negative whole blood test samples, each spiked with the same two negative sera, were tested. 
Samples contained no antibodies specific for the relevant antigens and the test samples were 
negative after assay on the test (Le. no reactivity, as indicated by no visible bar signifying a 

5 reaction in either position O, M or 2). Test sample was present in each test device, as indicated 
by a positive reaction bar in the test sample reactivity zone. 

(iii) Assaying for HIV- 1 Group M Antibody. Five HIV-1 Group M sera and five 
whole blood samples spiked with the HIV-1 Group M positive sera were tested using ten 
devices. HIV-1 Group M samples were seen to contain antibodies specific for HIV-1 Group 

10 M antigen (pTB319) as shown by development of a reaction line at the HIV-1 Group M antigen 
zone, and visible reaction lines could be seen in the assay completion zone of nine out of 10 
test devices. Although a band was present in one particular test device in the capture zone for 
HIV-1 group M antibody, test sample did not reach the assay completion zone and, thus, the 
assay needed to be repeated for this particular sample. No cross-reactivity was observed with 

15 the capture reagents for HIV group O and HIV-2. 

(iv) Assaying for HIV-1 Group O Antibodies. Two confirmed positive HIV-1 Group 
O sera and two whole blood test samples spiked with HIV-1 Group O sera were tested using 
an additional four devices. The HIV-1 Group O samples were found to contain antibodies 
specific for HIV-1 Group O antigen as indicated by a positive bar result in the HIV-1 Group O 

20 antigen capture zone area, with reaction lines visible in the assay completion zone of each 

device. No cross-reaction with HIV-1 group M or HIV-2 capture antigens (no visible bar) was 
observed. 

(v) Assaying for HIV-2 Antibodies. Ten further test devices were used to test five 
HIV-2 confirmed positive sera and whole blood spiked with the 5 HIV-2 sera. The HIV-2 

25 samples were found to contain antibodies specific for HIV-2 antigen (pHIV210) as shown by 
reaction bars at the HIV-2 antigen zone. No reaction was observed between these test samples 
and the HIV-1 Group O or HIV-1 Group M antigens; visible reaction lines were seen in the 
assay completion zone of each device. 

(vi) Assaying HIV-1 Group M, HIV-1 Group O, HIV-2 and Negative Samples. Four 
30 final devices were used to test an HIV-1 Group M-positive test sample, an HTV-1 Group O- 

positive test sample, an HIV-2-positive test sample and a negative control sample. The 
negative test serum did not react with any antigen in the antigen capture zone; the HIV-1 Group 
M-positive test sample was reactive only with the HIV-1 Group M antigen; the HIV-1 Group 
O-positive test sample was reactive only with the HIV-1 Group O antigen; and the HIV-2- 
35 positive test sample was reactive only with the HIV-2 antigen. Visible reaction lines were seen 
in the assay completion zone of each device. 

The five HIV-1 group M and the two HIV-1 group O test samples used were confirmed 
seropositive samples which had been previously tested using a commercially-available enzyme 
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immunoassay (Abbott #3A77) and had been PCR amplified, sequenced and subtyped based 
on phylogenetic analysis. The five HIV-2 samples used were seropositive using the same EIA 
and were confirmed as HIV-2-positive samples using an HIV-2 Western blot test (Sanofi). 

5 Example 11 

Construction of Synthetic HIV-1 Group M and HIV-1 Grou p O Hybrid Genes 

A. Modification of dTB3 19 

The plasmid pTB319 (U.S. Patent No. 5,124,255, incorporated herein by reference) 

10 encodes a truncated gp41 recombinant protein due to a one base deletion within the synthetic 
HIV-1 Group M gp41 gene resulting in a frame-shift. In order to facilitate the generation of 
HIV-1 Group M and Group O hybrid gene constructs, site-specific mutagenesis was used to 
eliminate the frame-shift within the gp41 coding region in pTB319. This was accomplished by 
sequentially digesting the plasmid pTB319 with the restriction endonucleases Rsr II and Bst 

15 XL The synthetic oligonucleotides pTB3 19+A (SEQ ID NO:98) and pTB3 19+T (SEQ ID 
NO:99) were annealed and ligated into the Rsr II and Bst XI digested pTB319. The ligation 
product was used to transform supercompetent XLl-Blue cells and the cells were plated on LB 
agar plates supplemented with 150 |ig/ml ampicillin. Colony PCR was used to identify 
correctly modified clones using the primer combinations pTB-S4 (SEQ ID NO: 100) / pTB-S7 

20 (SEQ ID NO:101) and pTB-S4 (SEQUENCE ID NO:100) / 63168 (SEQ ID NO:121). 

Overnight cultures were established for candidate clones in LB broth supplemented with 3 mM 
glucose and 200 jag/ml ampicillin for preparation of miniprep DNA. The entire coding region 
was sequenced using the oligonucleotide primers: 43461 (SEQ ID NO:2), 43285 (SEQ ID 
NO:l), CKS-1 (SEQ ID NO:30), CKS-3 (SEQ ID NO:32)> pTB-Sl (SEQ ID NO:102), pTB- 

25 S2 (SEQ ID NO: 103), pTB-S3 (SEQ ID NO: 104), pTB-S4 (SEQ ID NO: 100), pTB-S5 (SEQ 
ID NO: 105), pTB-S6 (SEQ ID NO:106), pTB-S7 (SEQ ID NO:101), and pTB~S8 (SEQ ID 
NO:28). Based on sequencing results, clone pTB319+A-#31 (pGMcks-1) has the desired 
coding region sequence. This clone was subsequently designated as pGM-lCKS/XLl (SEQ 
ID NO: 107 presents the nucleotide sequence of the coding region). Figure 12 presents the 

30 amino acid sequence of the pGM- 1CKS recombinant protein (SEQ ID NO: 108). 

B. Construction of pGO-12CKS/XLl 

pGO-12CKS/XLl encodes the recombinant protein pGO-12CKS, the amino acid 
sequence of which (SEQ ID NO:91) is shown in Figure 13. This protein consists of 250 
35 amino acids of CKS/polylinker fused to 42 amino acids of env gpl20 (HIV-1 Group M, 

HXB2R isolate), 200 amino acids of env gp41 (HIV-1 Group M, HXB2R isolate), 45 amino 
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acids of env gpl20 (HIV-1 Group O, HAM112 isolate), and 199 amino acids of env gp41 
(fflV-1 Group O, HAM112 isolate). pGO-12CKS/XLl was constructed as follows: 

A PCR reaction (100 |il volume) was set up with UlTma DNA Polymerase (3U) and 
IX buffer along with 1.5mM MgCl2, 40|iM of each dNTP, 50pmol of pTB/O-5' (SEQ ID 
NO:109), 50 pmol pGO-9/Kpn (SEQ ID NO:110), and 1 ng pG0-9PL DNA (miniprep H5; 
obtained from Example 3, Section F above) as template. The reaction was incubated at 94°C 
for 105 seconds then amplified with 22 cycles of 94°C for 30 seconds, 55°C for 30 seconds, 
and 72°C for 75 seconds, followed by incubation at 72°C for 5 minutes. The pTB/O-5' / 
pGO-9/Kpn PCR product was isolated on gel. The pTB/O-5' / pGO-9/Kpn PCR product and 
pGM-lCKS plasmid (described in Section A hereinabove) were digested sequentially with Asp 
718 (Boehringer Mannheim Biochemicals) and Bst XL The digested vector was then treated 
with calf intestinal alkaline phosphatase (Boehringer Mannheim Biochemicals), extracted with 
phenol / chloroform, and precipitated with ethanol. The digested PCR product was purified on 
a Centri-Sep column (Princeton Separations). Digested PCR product was ligated into the 
digested and phosphatased pGM-lCKS vector overnight at 16°C. XLl-Blue supercompetent 
cells were transformed with the ligation product and plated on LB + ampicillin plates 
supplemented with 20 mM glucose. Colonies were restreaked for isolation on the same type of 
plates. An overnight culture (LB medium + lOO^ig/ml carbenicillin + 20 mM glucose) of clone 
pGO-12CKS clone #1 was set up. Frozen stocks (0.5 ml 80% glycerol + 0.5 ml overnight 
culture) were made and miniprep DNA was prepared for sequencing. The following 
oligonucleotides were used as primers for sequence analysis: CKS-1 (SEQ ID NO:30), CKS- 
2 (SEQ ID NO:31), CKS-3 (SEQ ID NO:32), CKS-4 (SEQ ID NO:33), CKS 176.1 (SEQ ID 
NO: 19), 3962 (SEQ ID NO: 111), 3965 (SEQ ID NO: 113), pTB-S2 (SEQ ID NO: 103), pTB- 
S3 (SEQ ID NO:104), pTB-S4 (SEQ ID NO:100), pTB-S5 (SEQ ID NO:105), syl20-Sl 
(SEQ ID NO:112), 41sy-lB (SEQ ID NO:29), 41sy-2B (SEQ ID NO:34), 41sy-4 (SEQ ID 
NO:23), pTB-S8 (SEQ ID NO:28). Based on the results of the sequence analysis, pGO- 
12CKS candidate clone #1 was designated as pGO-12CKS/XLl. (SEQ ID NO:90 presents the 
nucleotide sequence of the coding region, and SEQ ID NO:91 presents the encoded amino acid 
sequence.) 

C. Construction nfpGO-13CKS/XLl 

pGO-13CKS/XLl encodes the recombinant protein pGO-13CKS, the amino acid 
sequence of which (SEQ ID NO:93) is shown in Figure 14. This protein consists of 250 
amino acids of CKS/polylinker fused to 42 amino acids of env gpl20 (HIV-1 Group M, 
HXB2R isolate), 200 amino acids of env gp41 (HIV-1 Group M, HXB2R isolate), 45 amino 
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acids of env gpl20 (HIV-1 Group O, HAM1 12 isolate), and 169 amino acids of env gp41 
(fflV-1 Group O, HAM112 isolate). pGO-13CKS/XLl was constructed as follows: 

A PCR reaction (100 |il volume) was set up with UlTma DNA Polymerase (3U) and 
IX buffer along with 1.5mM MgCl2, 40pM of each dNTP, 50pmol of pTB/O-5' (SEQ ID 

5 NO: 109), 50 pmol pGO-8/Kpn (SEQ ID NO: 1 14), and 1 ng pG0-9PL DNA (miniprep H5; 
obtained from Example 3, Section F hereinabove) as template. The reaction was incubated at 
94°C for 105 seconds then amplified with 22 cycles of 94°C for 30 seconds, 55°C for 30 
seconds, and 72°C for 75 seconds, followed by incubation at 72°C for 5 minutes. The 
pTB/O-5' / pGO-8/Kpn PCR product was isolated on gel. The pTB/O-5' / pGO-8/Kpn PCR 

10 product and pGM- 1 CKS plasmid (described in Section A above) were digested sequentially 
with Asp 718 (Boehringer Mannheim Biochemicals) and Bst XI. The digested vector was then 
treated with calf intestinal alkaline phosphatase (Boehringer Mannheim Biochemicals), 
extracted with phenol / chloroform, and precipitated with ethanol. The digested PCR product 
was purified on a Centri-Sep column (Princeton Separations). Digested PCR product was 

15 ligated into the digested and phosphatased pGM-lCKS vector overnight at 16°C. XLl-Blue 
supercompetent cells were transformed with the ligation product and plated on LB + ampicillin 
plates supplemented with 20 mM glucose. Colonies were restreaked for isolation on the same 
type of plates. An overnight culture (LB medium + 100(Xg/ml carbenicillin + 20 mM glucose) 
of clone pGO-13CKS clone #1 was set up. Frozen stocks (0.5 ml 80% glycerol + 0.5 ml 

20 overnight culture) were made and miniprep DNA was prepared for for sequencing. The 
following oligonucleotides were used as primers for sequence analysis: CKS-1 (SEQ ID 
NO:30), CKS-2 (SEQ ID NO:31), CKS-3 (SEQ ID NO:32), CKS-4 (SEQ ID NO:33), 43461 
(SEQ ID NO:2), 43285 (SEQ ID NO:l), pTB-Sl (SEQ ID NO: 102), pTB-S2 (SEQ ID 
NO:103), pTB-S3 (SEQ ID NO:104), pTB-S4 (SEQ ID NO:100), pTB-S5 (SEQ ID NO:105), 

25 syl20-Sl (SEQ ID NO:112), 41sy-lB (SEQ ID NO:29), 41sy-2B (SEQ ID NO:34), 41sy-4 
(SEQ ID NO:23), pTB-S8 (SEQ ID NO:28). Based on the results of the sequence analysis, 
pGO-13CKS candidate clone #1 was designated as pGO-13CKS/XLl. (SEQ ID NO:92 
presents the nucleotide sequence of the coding region, and SEQ ID NO:93 presents the 
encoded amino acid sequence.) 

30 

D Construction of pOO-14PL/DH5a 

pGO-14PL/DH5a encodes the recombinant protein pGO-14PL, the amino acid 
sequence of which (SEQ ID NO:95) is shown in Figure 15. This protein consists of an 
N-terminal methionine followed by 45 amino acids of env gpl20 (HIV-1 Group O, F1AM1 12 
35 isolate), 200 amino acids of env gp41 (HIV-1 Group O, HAM1 12 isolate) fused to 42 amino 
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acids of env gpl20 (HIV-1 Group M, HXB2R isolate), and 200 amino acids of env gp41 
(HIV-1 Group M, HXB2R isolate). pGO-14PL/DH5a was constructed as follows: 

A PCR reaction (100 \il volume) was set up with UlTma DNA Polymerase (3U) and 
IX buffer along with 1.5mM MgCl2, 40(lM of each dNTP, 50pmol of pTB/Age5' (SEQ ID 

5 NO: 1 15), 50 pmol pGO/B-3' (SEQ ID NO: 116), and 1 ng pGM-lCKS DNA (miniprep of 
pTB319+A-#31; obtained from Section A above) as template. The reaction was incubated at 
95°C for 30 seconds then amplified with 22 cycles of 94°C for 30 seconds, 55°C for 30 
seconds, and 72°C for 60 seconds, followed by incubation at 72°C for 5 minutes. The 
pTB/Age5' / pGO/B-3' PCR product was isolated on gel. The pTB/Age5' / pGO/B-3' PCR 

10 product and pGO-9PL plasmid (obtained from Example 3, Section F hereinabove) were 

digested sequentially with Age I and Bam HI. The digested vector was then treated with calf 
intestinal alkaline phosphatase (Boehringer Mannheim Biochemicals), extracted with phenol / 
chloroform, and precipitated with ethanol. The digested PCR product was purified on a 
Centri-Sep column (Princeton Separations). Digested PCR product was ligated into the 

15 digested and phosphatased pGM-lCKS vector overnight at 16°C. DH5a competent cells were 

transformed with the ligation product and plated on LB + ampicillin (150 |Xg/ml) plates. 
Colonies were analyzed for the presence of the proper insert by colony PCR using the vector 
primers pKRR EcoRl forward (SEQ ID NO:38) and pKRR BamHl reverse (SEQ ID NO:39). 
Colonies containing candidate clones were restreaked for isolation on the same type of plates. 

20 Overnight cultures (LB medium + 100|lg/ml carbenicillin) were set up to generate frozen stocks 
and miniprep DNA. Frozen stocks (0.5 ml 80% glycerol + 0.5 ml overnight culture) were 
made and miniprep DNA was prepared for sequencing. The following oligonucleotides were 
used as primers for sequence analysis: pTB-Sl (SEQ ID NO: 102), pTB-S2 (SEQ ID 
NO:103), pTB-S3 (SEQ ID NO:104), pTB-S4 (SEQ ID NO:100), pTB-S5 (SEQ ID NO:105), 

25 41sy-lC (SEQ ID NO:40), 41sy-2 (SEQ ID NO:41), 41sy-3 (SEQ ID NO:42), 41sy-4 (SEQ 
ID NO:23), pKRREcoRl forward (SEQ ID NO:38), pKRR BamHl reverse (SEQ ID NO:39). 
Based on the results of the sequence analysis, pGO-14PL candidate clone #11 was designated 
as pGO-14PL/DH5a. (SEQ ID NO:94 presents the nucleotide sequence of the coding region, 
and SEQ ID NO:95 presents the encoded amino acid sequence.) 
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Example 12 

Construction of a HIV-1 Group O env gp!20 / gp41 Synthetic Gene with a 
Second Copy of the gp41 Immunodominant Region (TDK) Fused to the C-terminus 

5 A. Construction of PGO-15CKS/XL1 

pGO-15CKS/XLl encodes the recombinant protein pGO-15CKS, the amino acid 
sequence of which (SEQ ID NO:97) is shown in Figure 16. This protein consists of 246 
amino acids of CKS/polylinker fused to 45 amino acids of env gpl20 (HIV-1 Group O, 
HAM112 isolate), 199 amino acids of env gp41 (HIV-1 Group O, HAM112 isolate), followed 
10 by a 4 amino acid linker (Gly, Gly, Gly, Ser) and 32 amino acids encompassing the IDR 

region of env gp41 (HIV-1 Group O, HAM112 isolate). pGO-15CKS/XLl was constructed 
as follows: 

The plasmid pGO-1 1CKS propagated in XLl-Blue cells (obtained from Example 3, 
Section K) was digested sequentially with Age I and Bam HI, extracted with phenol / 

15 chloroform, and precipitated with ethanol. The synthetic oligonucleotides synIDR#2-A (SEQ 
ID NO: 1 17) and synIDR#2-B (SEQ ID NO:l 18) were kinased with polynucleotide kinase 
(Boehringer Mannheim Biochemicals) following the manufacturer's recommended procedure. 
The kinased oligonucleotides were annealed and the duplex ligated to the digested (Age I + 
Bam HI) pGO-1 1CKS vector. Supercompetent XLl-Blue cells were transformed with the 

20 ligation product, and the cells were plated on LB plates supplemented with 150 |ig/ml 

ampicillin and incubated overnight. Colony PCR (primers 41sy-lB SEQ ID NO:29 and pTB- 
S8 SEQ ID NO:28) was used to identify candidate clones. Colonies were restreaked for 
isolation on LB plates supplemented with 150 (Xg/ml ampicillin. Overnight cultures of the 
candidate clones were established in 2X LB broth (Life Technologies, Inc.) supplemented with 

25 100 mg/ml carbenicillin and 20 mM glucose (Sigma Chemical Co.). Miniprep DNA was 
prepared from the overnight cultures using a Promega 373 DNA isolation kit (Promega 
Corporation, Madison, WI) following the manufacturer's recommended procedure. The 
overnight cultures were also used to establish frozen stocks. Cells were pelleted and 
resuspended in 2X LB broth with 20% glycerol (J.T. Baker, Phillipsburg, NT) and frozen at 

30 -70°C. The following oligonucleotide primers were used for sequence analysis: CKS-1 (SEQ 
ID NO:30), CKS-3 (SEQ ID NO:32), 43285 (SEQ ED NO:l), 43461 (SEQ ID NO:2), 41sy- 
1B (SEQ ID NO:29), 41sy-2B (SEQ ID NO:34), 41sy-3B (SEQ ID NO:35), 41sy-4 (SEQ ID 
NO:23), and CKS3583 (SEQ ID NO:20). Based on sequencing results, candidate clone pGO- 
15CKS-48 was designated as pGO-15CKS/XLl. (SEQ ID NO:96 presents the nucleotide 

35 sequence of the coding region, and SEQ ID NO:97 presents the encoded amino acid sequence.) 
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B. Construction of pGO-1 5PL/DH5a. 

pGO-15PL/DH5a encodes the recombinant protein pGO-15PL, the amino acid 
sequence of which (SEQ ID NO: 120) is shown in Figure 17. This protein consists of an 
N-terminal methionine followed by 45 amino acids of env gpl20 (fflV-1 Group O, HAM1 12 
isolate), 199 amino acids of env gp41 (HIV-1 Group O, HAM1 12 isolate), a 4 amino acid 
linker (Gly, Gly, Gly, Ser) and 32 amino acids encompassing the IDR region of env gp41 
(HIV-1 Group O, HAM112 isolate). pGO-15PL/DH5 was constructed as follows: 

A PCR reaction (100 (Xl volume) was set up with AmpliTaq DNA Polymerase (2.5U) 
and IX buffer along with 40uM of each dNTP, 50pmol of 41sy-3B (SEQ ID NO:35), 50 pmol 
pTB-S8 (SEQ ID NO:28), and 1 ng pGO-15CKS DNA (miniprep of candidate clone pGO- 
15CKS-48; obtained from Section A above) as template. The reaction was incubated at 95°C 
for 30 seconds, then amplified with 35 cycles of 94<>C for 20 seconds, 50°C for 30 seconds, 
and 72°C for 60 seconds, followed by incubation at 72°C for 7 minutes. The amplified 
product was purified using a QIAquick PCR Purification Kit (Qiagen). The purified 41sy-3B / 
pTB-S8 amplification product was digested sequentially with Age I and Bam HI, then ligated 
to pGO-9PL (Age I + Bam HI digested / phosphatased vector prep from Example 3, Section J 
above). Competent DH5a cells were transformed using the ligation product and plated on LB 
plates supplemented with 150 |Hg/ml ampicillin. Candidate clones were identified by colony 
PCR with the primers 41sy-3 (SEQ ID NO:42) and pKRR BamHI reverse (SEQ ID NO:39), 
followed by digestion of the PCR product with Age I. Candidate clone #4 was restreaked for 
isolation. A culture of clone #4 was established in 2X LB broth (Life Technologies) 
supplemented with 100 (ig/ml carbenicillin (Sigma Chemical Co.) and incubated at 34°C 
overnight. Miniprep DNA was prepared from part of the overnight culture using a Promega 
373 DNA Isolation Kit (Promega Corp.) as outlined by the manufacturer. Frozen stocks were 
established by pelleting the remaining overnight culture and resuspending the cells in Terrific 
Broth with 20% glycerol (J.T. Baker Co.) and freezing at -70°C. The following 
oligonucleotide primers were used for sequence analysis: pKRR EcoRl forward (SEQ ID 
NO:38), pKRR BamHI reverse (SEQ ID NO:39), 41sy-lC (SEQ ID NO:40), 41sy-2 (SEQ ID 
NO:41), 41sy-3 (SEQ ID NO:42), 41sy-3B (SEQ ID NO:35) and 41sy-4 (SEQ ID NO:23). 
Based on sequencing results, candidate pGO-15PL clone #4 was designated as 
pGO-15PL/DH5a. (SEQ ID NO: 119 presents the nucleotide sequence of the coding region, 
and SEQ ID NO: 120 presents the encoded amino acid sequence.) 
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Example 13 

Preparation and Purification of HTV-1 Group O Recombinant gp41 Antigens t>GO-8 PL, 
pGO-9 PL. PGO-12CKS. pGO-U PL and pGO~15CKS 

5 The above antigens were prepared by growing and inducing E. coli strains containing 

the respective fflV-1 Group O recombinant gp41 antigen constructs as described in Example 5. 
The resulting frozen cells were resuspended by homogenization in cold lysis buffer comprising 
50 mM Tris pH 8, 10 mM Na EDTA , 150 mM NaCl , 8% (w/v) sucrose, 5% Triton X-100 ® 
(v/v), 1 mM PMSF and 1 |iM pepstatin A. Lysozyme was added to the homogenates at a 

10 concentration of 1.3 mg per gram of cells processed, and incubated for 30 minutes on ice to 
lyse the cells. Inclusion bodies were separated from soluble proteins by centrifugation. These 
pelleted inclusion bodies were washed and pelleted sequentially in 1) Lysis Buffer; 2) 10 mM 
Na EDTA pH 8, 30% (w/v) sucrose; and 3) water. The washed inclusion bodies were 
resuspended in 50 mM Tris pH 8, 10 mM Na EDTA, 150 mM NaCl and 3 M urea, and 

15 incubated on ice for 1 hour. The inclusion bodies then were separated from the solubilized 
proteins by centrifugation. The pelleted inclusion bodies were fully solubilized in 7 M 
guanidine-HCl, 50 mM Tris pH 8, 0.1% (v/v) beta-mercaptoethanol (BME) overnight at 4°C 
The solubilized recombinant antigen(s) were clarified by centrifugation, passed through a 0.2 
|xm filter. The solubilized gp41 antigen(s) were precipitated from the 7 M Guanidine-HCl 

20 solution by dilution (1:7) with water to a final concentration of 1 M Guanidine-HCl. After 
incubation at 4° C for 30 minutes, the precipitated proteins were centrifuged and resolubilized 
in 50 mM Tris pH 8, 9 M Urea, 0. 1% BME (v/v) overnight at 4° C 

Solubilized HIV-1 Group O recombinant gp41 antigens were next purified as follows: 
The recombinant antigens were first purified by anion and/or cation exchange chromatography 

25 using Q-Sepharose (Pharmacia) or S-Sepharose (Pharmacia) columns. The solubilized gp41 
antigen solutions were loaded onto either a Q-Sepharose or S-Sepharose column that had been 
previously equilibrated with 50 mM Tris pH 8, 8M Urea, 0.1% BME (v/v). The gp41 
antigens either (1) passed though the column directly and were collected in the void volume or 
(2) were bound to the column matrix. If adsorbed, the gp41 antigens were eluted from the 

30 columns by a 0-1M NaCl gradient. SDS-polyacrylamide gel electrophoresis was used to 
analyze fractions from the Q-Sepharose or S-Sepharose columns. Fractions containing the 
recombinant gp41 antigens were pooled and then concentrated by ultrafiltration. The 
recombinant antigen concentrates were treated with 4% SDS (w/v) and 5% BME (w/v) at room 
temperature for three hours. SDS treated antigens were further purified by size exclusion 

35 chromatography on a Sephacryl S-300 (Pharmacia) column equilibrated with 25 mM Tris pH 
8, 0.15 M NaCl, 0.1% v/v BME, 0.1% SDS (w/v). SDS-polyacrylamide gel electrophoresis 
was used to analyze the fractions from the S-300 column. Fractions containing purified 
recombinant antigens were pooled, passed through a 0.2 |am filter and stored at -70° C 
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Example 14 

Test of Recombinant Antigen Reactivity with HIV-1 Group M and Group O Samples 

5 A. Bead Coating 

In order to examine the reactivity of recombinant HIV-1 antigens, purified 
recombinants were coated on quarter inch polystyrene beads. These antigen coated beads were 
used in a series of capture assays to access reactivity to both HIV-1 Group M and Group O 
samples. 

10 Recombinant antigens were coated on quarter inch beads at 0.5 (Xg/ml in PBS. The 

following recombinant antigens were coated: pTB319 (Group M), pGO-9/CKS, pGO-1 1/PL, 
pGO-12/CKS, pGO-14/PL and pGO-15/CKS (all Group O). 

The procedure for coating the recombinant antigens on the beads is as follows: For 
each antigen, 35.5 gm. (-250) of beads, (Abbott Laboratories code 93-2556, lot 6840M100), 

15 were washed in 15% N-propanol in water for 30 minutes at 40°C. All incubations and washes 
were done in small brown glass jars on a shaker platform. The N-propanol solution was 
aspirated off, 58.25 ml of antigen solution was added, and the beads were incubated for two 
hours at 40°C. The antigen solution was aspirated off, and 60 ml of a 0.1% Triton X-100 
solution in PBS was added for 30 minutes at 40°C. The beads were then washed with 60 ml of 

20 PBS two times and incubated with 60 ml of 2% BS A in PBS for 30 minutes at 40°C. The BSA 
was aspirated and the beads were washed again in PBS. The beads were then incubated with 
60 ml 0.5% sucrose in PBS for 15 minutes at room temperature. After 15 minutes, the sucrose 
was aspirated and the beads were allowed to air dry. Coated beads were strored in 
polypropylene bottles with a desiccant at 4°C 

25 

B. Assays 

Recombinant antigen coated beads were tested for reactivity against a variety of samples 
using the Abbott Laboratories 3A1 1 kit (first generation, indirect assay format). Samples were 
diluted and added to wells in polystyrene trays. Beads were added and the trays were 

30 incubated at 40°C for 1 hour. The trays were washed with water in an Abbott Laboratories 
QUICKWASH device. Next the kit conjugate, an anti-human IgG-Horseraddish Peroxidase, 
was added and the trays were again incubated at 40°C for one hour. The trays were again 
washed and 300 \il of substrate solution, (1.28 mg/ml o-Phenylenediamine»HCl in Citrate- 
Phosphate buffer containing 0.02% Hydrogen Peroxide), was added to each well for 30 

35 minutes at room temperature. 1 ml of IN sulfuric acid was added to stop the reaction, and the 
trays were read in an Abbott QUANTUM spectrophotometer. 

The samples used for this study were Normal human plasma, (Abbott Laboratories 
code 99800, lot 17535M400), used as a negative control; HIVPL-31 (Group M positive sera), 
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and the following Group O positive sera: 14283, 189404, 193Ha, 14791, 267Ha and ESP-1. 
All samples except the Normal human plasma control were run at three dilutions; 1:1,000, 
1 : 10,000 and 1 : 100,000 in the kit specimen diluent. Each dilution of each sample was run in 
duplicate against each of the six beads, and the results of each dilution were averaged and 
5 plotted for each bead. 

C. Results 

The results of the above tests, shown in Figures 18-23, demonstrate the improvements 
in sensitivity and selectivity available by use of the recombinant antigens of the present 

10 invention. The bead coated with the HIV-1 Group M recombinant antigen (pTB3 19) detected 
the Group M serum sample, but failed to detect all but one of the Group O samples. The beads 
coated with only HIV-1 Group O recombinant antigens (pGO-9/CKS, pGO-1 1/PL, and PGO- 
15/CKS) detected the Group O serum samples, but showed lower sensitivity in detection of the 
HIV-1 Group M sample. Beads that were coated with hybrid Group M and Group O 

15 recombinant antigens (pGO- 12/CKS, and pGO-14/PL) were able to detect both HIV-1 Group 
M- and Group O-positive samples. Lastly, pGO-15/CKS, which has an additional sequence 
representing the Group O immunodominant region of gp41 linked by recombinant means to the 
carboxy end of the protein, showed greater reactivity to low-titer Group O samples. 

20 Example 15 

Examination of Assay Sensitivity for HIV-1 Group O -Tnfected Samples 
Using Group O Recombinant Antigens nGO-9CKS and pGO-llCKS 



A. Assays 

25 In order to evaluate the performance in immunoassays of antigen constructs of the 

present invention, recombinant antigens pGO-9CKS and pGO-1 1CKS were incorporated into 
four HIV-l/HIV-2 immunoassays containing HIV-1 Group M (subtype B) reagents. The 
constructs were tested using one bead assay (Assay 1) and 3 automated microparticle-based 
assays (Assays 2-4). In all cases, the reactivity of HIV-1 Group O-infected specimens was 

30 assessed with (format 2) and without (format 1) incorporation of the HIV-1 group O 

recombinants. The coated beads/microparticles were reacted with multiple dilutions of the 
following HIV-1 Group O-positive human sera: ESP1, 189404, 193Ha, 341 Ha, 2156 and 
ABB 9/96. 

For Assay 1, purified pGO-1 1CKS was incorporated into a commercially-available 
35 bead-based assay by coating the antigen construct onto quarter-inch polystyrene beads. The 
coated beads were reacted with a range of dilutions of HIV-1 Group O-positive human sera, 
washed, and then reacted with purified pGO-9CKS conjugated to horseradish peroxidase. 
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After washing/separation of bound from unbound pGO-9CKS conjugate, substrate was added 
and the assay was completed as indicated in Example 14. 

For Assay 2, purified pGO-UCKS was incorporated into a second commercially- 
available assay by coating the antigen construct onto microparticles. The coated microparticles 
were reacted with the same range of dilutions of HIV-1 Group O-positive human sera utilized 
in Assay L The microparticles were then washed and subsequently reacted with biotinylated 
pG0~9CKS. After further washing, the microparticles were reacted with a polyclonal anti- 
biotin antibody conjugated to alkaline phosphatase. The assay signal was developed by 
addition of the substrate methylumbelliferyl phosphate. 

For Assay 3, purified pGO-1 1CKS was incorporated into a third commercially- 
available assay by coating the antigen construct on microparticles. The coated microparticles 
were again reacted with the same range of dilutions of fflV-1 Group O-positive human sera 
utilized in Assay 1. Next, the microparticles were washed and then reacted with biotinylated 
pGO-9CKS. After washing, the microparticles were reacted with an anti-biotin antibody 
conjugated to acridinium as the signal-generating compound. 

For Assay 4, purified pGO-1 1CKS was incorporated into a developmental assay by 
coating the antigen construct onto magnetic microparticles. As in Assay 1, the coated 
microparticles were reacted with a range of dilutions of HIV-1 Group O-positive human sera, 
washed, and subsequently reacted with pGO-9CKS conjugated to acridinium. 

B. Results 

The results of the above tests are presented in Tables 1 and 2 below, in which the data 
are presented as signal/cutoff (S/CO) ratios. Format 1 refers to the conventional assay without 
the antigen constructs of the present invention, while Format 2 refers to the assay 
supplemented with the HIV-1 group O constructs. 

From these data, it can be seen that the addition of the HIV-1 Group O recombinants 
resulted in a significant enhancement of assay sensitivity for the HIV-1 Group O-infected sera 
at all of the dilutions tested. For example, in the case of Assay 1 and sample 193Ha a S/CO 
ratio of 7.14 was obtained at a 1:10 dilution using Format 1, while a similar S/CO (7.22) was 
obtained at a 160-fold greater dilution (1:1600) using Format 2. This trend was maintained 
across all of the tested assay platforms. The utility of the group O recombinants was 
particularity evident for sample 2156, which tested negative (S/CO < 1) in all 4 assays prior to 
the addition of the group O recombinants. With the addition of the HIV-1 Group O constructs, 
however, this sample 2156 tested positive in all four assays at a 1:400 dilution. In Assay 1, 
2156 was still positive at a dilution of 1:5000. Addition of the recombinant reagents pGO- 
9CKS and pGO-1 1CKS was thus seen to provide a substantially better sensitivity for HIV-1 
Group O-infected sera when using the above direct-format immunoassays. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION 

(i) APPLICANT: Hackett, John R. Jr. 

Yamaguchi, Julie 
Golden, Alan M. 
Brennan, Catherine A. 
Hickman, Robert K. 
Devare, Sushi 1 G. 

(ii) TITLE OF THE INVENTION: Recombinant Antigens Useful In 1 
Detection And Differentiation Of Antibodies To HIV 

(iii) NUMBER OF SEQUENCES : 121 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Abbott Laboratories 

(B) STREET: 100 Abbott Park Road 

(C) CITY: Abbott Park 

(D) STATE: IL 

(E) COUNTRY: USA 

(F) ZIP : 60064-3500 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5 inch Diskette, 1.4MB, DOS- formatted 

(B) COMPUTER: Power Macintosh 7100/66 

(C) OPERATING SYSTEM: MacOS 7.1.2 (DOS emulation) 

(D) SOFTWARE: WordPerfect 3.1 (saved as Text Export, ASCII 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C ) CLASSIFICATION : 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Danckers, Andreas M. 

(B) REGISTRATION NUMBER: 32,652 

(C) REFERENCE /DOCKET NUMBER: 6 165. US. 01 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 847-937-9803 

(B) TELEFAX: 847-938-2623 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
GAGATCTTCA GGGGTATCC 



(2) INFORMATION FOR SEQ ID NO : 2 : 



-58- 



6165.US.O! 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
GGATCATCGG TTCATCACCC 

10 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 114 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CATGATCGGT GGTGACATGA AAGACATCTG GCGTAACGAA CTGTTCAAAT ACAAAGTTGT 
TCGTGTTAAA CCGTTCTCTG TTGCTCCGAC CCCGATCGCT CGTCCGGTTA TCGG 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 111 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

35 GCAGGTTCCA CTATGGGTGC TGCAGCTACC GCTCTGACCG TACAGACCCA CTCTGTTATC 6 
AAAGGTATCG TACAGCAGCA CGACAACCTG CTGCGTGCAA TCCAGGCACA G H 



40 (2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

50 AGCTGCTGGT TCTGGATCAG GGTTTCCAGT GCCAGCAGAC GAGCACGCAG CTGACGGATA 60 
CCCCATACAG ACAGACGCAG CAGTTCCTGC TGTGCCTGGA TTGCACGCAG 110 



(2) INFORMATION FOR SEQ ID NO : 6 : 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
60 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
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CTGATCCAGA ACCAGCAGCT GCTGAACCTG TGGGGCTGCA AAGGTCGTCT GATCTGCTAC 60 
ACCTCCGTTA AATGGAACGA AACCTGGCGT AACACCACCA ACATCAACCA G HI 

5 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 117 base pairs 
\0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

CTGAACCTGA GCTTTCTGGA TTTCTTCGTA GATGGTGGAA GAAACGTTGT CGATCTGCTG 
GTCCCATTCC TGCCAGGTCA GGTTACCCCA GATCTGGTTG ATGTTGGTGG TGTTACG 



60 
117 



20 (2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

30 TCCAGAAAGC TCAGGTTCAG CAGGAACAGA ACGAAAAAAA ACTGCTGGAA C TGGACGAAT 
GGGCTTCTCT GTGGAACTGG CTGGACATCA CCAAATGGCT G 



(2) INFORMATION FOR SEQ ID NO : 9 : 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

ACCTTCACCG GTACGACCCG GAGTTTCAGC TTCAGACTGC TGACGGGTCG GGATCTGCAG 60 
45 GGACAGCGGC TGGTAGCCCT GACGGATGTT ACGCAGCCAT TTGGTGATGT CCAG 114 



(2) INFORMATION FOR SEQ ID NO: 10: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

^ 5 ( X i) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CGGGTCGTAC CGGTGAAGGT GGTGGTGACG AAGGCCGTCC GCGTC TGATC CCGTCTCCGC 
AGGGTTTCCT GCCGCTGCTG TACACCGACC TGCGTACCAT CATCCTG 

60 

(2) INFORMATION FOR SEQ ID NO: 11: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
C TAC AAGAAT TCCATGATCG GTGGTGACAT G 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

GTCTGTGGAT TC TGGGTC AG AAAATCATCG ACGCTTGCCG TATCTGCGCT GCTGTTATCC 
ACTACTGGCT GCAGGAACTG CAGAAATCCG CTACCTCCCT GATCGACAC 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13 : 

GCGAACACGA CGCGGGATGT TCAGGATACC ACGACCCAGA CGC TGG AT AC CACGGATGAT 
GTCGTCAGTC CAGTTAGCAA CTGCAACAGC GAAGGTGTCG ATCAGGGAGG TAGC 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
ATAGTAGGAT CCTATTACAG CAGAGAGCGT TCGAAGCCCT GGCGAACACG ACGCGGGATG 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
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ATAGTAGGAT CCTATTATTC ACCGGTACGA CCCGGAGTTT CAG 43 

5 (2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
15 ATAGTAGGAT CCTATTACAG CCATTTGGTG ATGTCCAG 3 8 

(2) INFORMATION FOR SEQ ID NO: 17: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



25 



30 



45 



55 



60 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GC AC CC AT AG TGGAACCTGC TGCAGACAGA ACGCCCAGGA ACAGCATACC CAGACCTACA 60 
GCACGTTTTT CACGGTGGGT GCCAGTACCG ATAACCGGAC GAGCGA 10 6 



(2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 108 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



CTGACCCAGA ATCCACAGAC CCAGACGCAG GTGAGAGATA ACAGTCTGAG TACCAGAGAT 60 
CAGGTTAGAC AGCAGGTGGT AGGAC C AC AG GATGATGGTA CGCAGGTC 10 8 



(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GCAGCTTCGT GTTCTGTGGT ACGGCG 2 6 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
CGTAACGGTA CGACACTCC 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
CCGCTACCTC CCTGATCGAC ACCTTC 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GAAGGTGTCG ATCAGGGAGG TAGCGG 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
GATGTCCAGC CAGTTCCAC 



(2) INFORMATION FOR SEQ ID NO; 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

CTACAAGAAT TCCATGATCG GTGGTGACAT GAAAGACATC TGGCGTAACG AACTGTTCAA 
ATAC 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
CTACAAGAAT TCTATCGGTG GTGACATGAA AGAC 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
CGGGTCGTAC CGGTGAAGGT 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
ATAGTAGGAT C CTATTAC AG CAG 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 
GCCGGAAGCG AGAAGAATC 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
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TATCGTACAG CAGCAGGAC 



(2) INFORMATION FOR SEQ ID NO : 3 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
CCCATTAATG TGAGTTAGCT C 



(2) INFORMATION FOR SEQ ID NO : 3 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
CCTGACGAAT GATTGTCGCA 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 
ATTCAGCGAC GACACGGTG 



(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 
GTATCCACAC CTGTGCCA 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 



AGAGTGGGTC TGTACGGTC 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
AATGGGCTTC TCTGTGGAAC 



(2) INFORMATION FOR SEQ ID NO : 3 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
CTGTCTAACC TGATCTCTGG 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
ACGCAGGTGA GAGATAACAG 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GTGATACGAA ACGAAGCATT GG 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
GCGATATAGG CGCCAGCAAC C 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
CTCTGTTATC AAAGGTATCG T 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
AGCAGACGAG CACGCAGC 



(2) INFORMATION FOR SEQ ID NO : 42 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TO POLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 42 
TTCAGCAGGA ACAGAACG 



(2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 
TCCGCGTCTG ATCCCGTC 



(2) INFORMATION FOR SEQ ID NO: 44: 



-67- 



6165.US.O! 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

10 CCAGGCACAG CAGGAAC 17 



(2) INFORMATION FOR SEQ ID NO: 45: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



25 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
AC AC T ATAGA ATACTCAAGC 2 0 

(2) INFORMATION FOR SEQ ID NO: 46: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
TAATACGACT CACTATAGGG 20 



40 



(2) INFORMATION FOR SEQ ID NO: 47: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 741 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 47 : 

ATGATCGGTG GTGACATGAA AGACATCTGG CGTAACGAAC TGTTCAAATA CAAAGTTGTT 60 

50 CGTGTTAAAC CGTTCTCTGT TGCTCCGACC CCGATCGCTC GTCCGGTTAT CGGTACTGGC 12 0 

ACCCACCGTG AAAAACGTGC TGTAGGTCTG GGTATGCTGT TCCTGGGCGT TCTGTCTGCA 180 

GCAGGTTCCA CTATGGGTGC TGCAGCTACC GCTCTGACCG TACAGACCCA CTCTGTTATC 240 

AAAGGTATCG TACAGCAGCA GGACAACCTG CTGCGTGCAA TCCAGGCACA GCAGGAACTG 3 00 

CTGCGTCTGT CTGTATGGGG TATCCGTCAG CTGCGTGCTC GTCTGCTGGC AC TGGAAAC C 3 60 

55 CTGATCCAGA ACCAGCAGCT GCTGAACCTG TGGGGCTGCA AAGGTCGTCT GATCTGCTAC 42 0 

ACCTCCGTTA AATGGAACGA AACCTGGCGT AACACCACCA ACATCAACCA GATCTGGGGT 480 

AACCTGACCT GGCAGGAATG GGACCAGCAG ATCGACAACG TTTCTTCCAC CATCTACGAA 540 

GAAATCCAGA AAGCTCAGGT TCAGCAGGAA CAGAACGAAA AAAAACTGCT GGAAC TGG AC 60 0 

GAATGGGCTT CTCTGTGGAA CTGGCTGGAC ATCACCAAAT GGCTGCGTAA CATCCGTCAG 660 

60 GGCTACCAGC CGCTGTCCCT GC AGATCCCG ACCCGTCAGC AGTCTGAAGC TGAAACTCCG 72 0 

GGTCGTACCG GTGAATAATA G 741 
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(2) INFORMATION FOR SEQ ID NO: 48: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 245 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
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(2) INFORMATION FOR SEQ ID NO:49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: 

ATGAGTTTTG TGGTCATTAT TCCCGCGCGC 
TTGGTTGATA TTAACGGCAA ACCCATGATT 
GGTGCCGAGC GCATCATCGT GGCAACCGAT 
GCTGGCGGTG AAGTATGTAT GACGCGCGCC 
GAAGTTGTCG AAAAATGCGC ATTCAGCGAC 



SEQ ID NO: 49: 

TACGCGTCGA CGCGTCTGCC CGGTAAAC C A 

GTTCATGTTC TTGAACGCGC GCGTGAATCA 

CATGAGGATG TTGCCCGCGC CGTTGAAGCC 

GATCATCAGT CAGGAACAGA ACGTCTGGCG 

GACACGGTGA TCGTTAATGT GCAGGGTGAT 



-69- 



6165.US.O! 



GAACCGATGA TCCCTGCGAC AATCATTCGT 
GTGGGTATGA CGACTCTGGC GGTGCCAATC 
GCGGTGAAAG TGGTTCTCGA CGCTGAAGGG 
CCTTGGGATC GTGATCGTTT TGCAGAAGGC 
5 CATCTTGGTA TTTATGGCTA CCGTGCAGGC 
AGTCCGTTAG AACACATCGA AATGTTAGAG 
ATCCATGTTG CTGTTGCTCA GGAAGTTCCT 
GACCCGTCGA CGAATTCTAT CGGTGGTGAC 
AAATACAAAG TTGTTCGTGT TAAACCGTTC 

10 GTTATCGGTA CTGGCACCCA CCGTGAAAAA 
GGCGTTCTGT CTGCAGCAGG TTCCACTATG 
ACCCACTCTG TTATCAAAGG TATCGTACAG 
GCACAGCAGG AACTGCTGCG TCTGTCTGTA 
CTGGCACTGG AAACCCTGAT CCAGAACCAG 

15 CGTCTGATCT GCTACACCTC CGTTAAATGG 
AACCAGATCT GGGGTAACCT GACCTGGCAG 
TCCACCATCT ACGAAGAAAT CC AGAAAGCT 
CTGCTGGAAC TGGACGAATG GGCTTCTCTG 
CGTAACATCC GTCAGGGCTA CCAGCCGCTG 

20 GAAGCTGAAA CTCCGGGTCG TAC CGGTGAA 



CAGGTTGCTG ATAACCTCGC TCAGCGTCAG 3 60 
CACAATGCGG AAGAAGCGTT TAACCCGAAT 420 
TATGCACTGT ACTTCTCTCG CGCCACCATT 480 
CTTGAAACCG TTGGCGATAA CTTCCTGCGT 540 
TTTATCCGTC GTTACGTCAA CTGGCAGCCA 60 0 

CAGCTTCGTG TTC TGTGGTA CGGCGAAAAA 660 
GGCACAGGTG TGGATACCCC TGAAGATCTC 720 
ATGAAAGACA TCTGGCGTAA CGAACTGTTC 780 
TCTGTTGCTC CGACCCCGAT CGCTCGTCCG 840 
CGTGCTGTAG GTC TGGGTAT GCTGTTCCTG 9 00 
GGTGCTGCAG CTACCGCTCT GAC CGT AC AG 9 60 

CAGCAGGACA ACCTGCTGCG TGCAATCCAG 1020 

TGGGGTATCC GTCAGCTGCG TGCTCGTCTG 10 80 

CAGCTGCTGA AC C TGTGGGG CTGCAAAGGT 1140 

AACGAAACCT GGCGTAACAC CACCAACATC 12 0 0 

G AATGGG AC C AGCAGATCGA CAACGTTTCT 12 60 

CAGGTTCAGC AGGAACAGAA CGAAAAAAAA 1320 

TGGAACTGGC TGGACATCAC CAAATGGCTG 13 80 

TCCCTGCAGA TCCCGACCCG TCAGCAGTCT 1440 

TAATAG 1476 



(2) INFORMATION FOR SEQ ID NO: 50: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 



35 


Met 


Ser 


Phe 


Val 


Val 


He 


He 


Pro 


Ala 


Arg 


Tyr 


Ala 


Ser 


Thr 


Arg 


Leu 


1 








5 










10 










15 






Pro 


Gly 


Lys 


Pro 


Leu 


Val 


Asp 


lie 


Asn 


Gly 


Lys 


Pro 


Met 


He 


Val 


His 






20 










25 










30 








Val 


Leu 


Glu 


Arg 


Ala 


Arg 


Glu 


Ser 


Gly 


Ala 


Glu 


Arg 


He 


He 


Val 


Ala 


40 






35 










40 










45 






Glu 


Thr 


Asp 
50 


His 


Glu 


Asp 


Val 


Ala 
55 


Arg 


Ala 


Val 


Glu 


Ala 
60 


Ala 


Gly 


Gly 




Val 


Cys 


Met 


Thr 


Arg 


Ala 


Asp 


His 


Gin 


Ser 


Gly 


Thr 


Glu 


Arg 


Leu 


Ala 




65 








70 










75 








Val 
95 


80 


45 


Glu 


Val 


Val 


Glu 


Lys 
85 


Cys 


Ala 


Phe 


Ser 


Asp 
90 


Asp 


Thr 


Val 


He 


Asn 




Val 


Gin 


Gly 


Asp 


Glu 


Pro 


Met 


He 


Pro 


Ala 


Thr 


He 


He 


Arg 


Gin 


Val 








100 










105 










110 








Ala 


Asp 


Asn 


Leu 


Ala 


Gin 


Arg 


Gin 


Val 


Gly Met 


Thr 


Thr 


Leu 


Ala 


Val 


50 




115 










120 










125 






Val 




Pro 


He 


His 


Asn 


Ala 


Glu 


Glu 


Ala 


Phe 


Asn 


Pro 


Asn 


Ala 


Val 


Lys 






130 










135 










140 








He 




Val 


Leu 


Asp 


Ala 


Glu 


Gly 


Tyr 


Ala 


Leu 


Tyr 


Phe 


Ser 


Arg 


Ala 


Thr 




145 








150 










155 










160 


55 


Pro 


Trp 


Asp 


Arg 


Asp 


Arg 


Phe 


Ala 


Glu 


Gly 


Leu 


Glu 


Thr 


Val 


Gly 


Asp 








165 










170 










175 






Asn 


Phe 


Leu 


Arg 
180 


His 


Leu 


Gly 


He 


Tyr 
185 


Gly 


Tyr 


Arg 


Ala 


Gly 
190 


Phe 


He 




Arg 


Arg 


Tyr 


Val 


Asn 


Trp 


Gin 


Pro 


Ser 


Pro 


Leu 


Glu 


His 


He 


Glu 


Met 


60 




195 










200 










205 








Leu 


Glu 
210 


Gin 


Leu 


Arg 


Val 


Leu 
215 


Trp 


Tyr 


Gly 


Glu 


Lys 
220 


He 


His 


Val 


Ala 
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Val Ala Gin Glu Val Pro Gly Thr Gly Val Asp Thr Pro Glu Asp Leu 
225 230 235 240 

Asp Pro Ser Thr Asn Ser He Gly Gly Asp Met Lys Asp He Trp Arg 
245 250 255 

5 Asn Glu Leu Phe Lys Tyr Lys Val Val Arg Val Lys Pro Phe Ser Val 
260 265 270 

Ala Pro Thr Pro He Ala Arg Pro Val He Gly Thr Gly Thr His Arg 

275 280 285 

Glu Lys Arg Ala Val Gly Leu Gly Met Leu Phe Leu Gly Val Leu Ser 
10 290 295 300 

Ala Ala Gly Ser Thr Met Gly Ala Ala Ala Thr Ala Leu Thr Val Gin 
305 310 315 320 

Thr His Ser Val He Lys Gly He Val Gin Gin Gin Asp Asn Leu Leu 
325 330 335 

15 Arg Ala He Gin Ala Gin Gin Glu Leu Leu Arg Leu Ser Val Trp Gly 
340 345 350 

He Arg Gin Leu Arg Ala Arg Leu Leu Ala Leu Glu Thr Leu He Gin 

355 360 365 

Asn Gin Gin Leu Leu Asn Leu Trp Gly Cys Lys Gly Arg Leu He Cys 
20 370 375 380 

Tyr Thr Ser Val Lys Trp Asn Glu Thr Trp Arg Asn Thr Thr Asn He 
385 390 395 400 

Asn Gin He Trp Gly Asn Leu Thr Trp Gin Glu Trp Asp Gin Gin He 
405 410 415 

25 Asp Asn Val Ser Ser Thr He Tyr Glu Glu He Gin Lys Ala Gin Val 
420 425 430 

Gin Gin Glu Gin Asn Glu Lys Lys Leu Leu Glu Leu Asp Glu Trp Ala 

435 440 445 

Ser Leu Trp Asn Trp Leu Asp He Thr Lys Trp Leu Arg Asn He Arg 
30 450 455 460 

Gin Gly Tyr Gin Pro Leu Ser Leu Gin He Pro Thr Arg Gin Gin Ser 
465 470 475 480 

Glu Ala Glu Thr Pro Gly Arg Thr Gly Glu 
485 490 



35 



(2) INFORMATION FOR SEQ ID NO: 51; 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1125 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

ATGATCGGTG GTGACATGAA AGACATCTGG CGTAACGAAC TGTTCAAATA CAAAGTTGTT 60 

CGTGTTAAAC CGTTCTCTGT TGCTCCGACC CCGATCGCTC GTCCGGTTAT CGGTACTGGC 12 0 

ACCCACCGTG AAAAACGTGC TGTAGGTCTG GGTATGCTGT TCCTGGGCGT TCTGTCTGCA 180 

50 GCAGGTTCCA CTATGGGTGC TGCAGCTACC GCTCTGACCG TACAGACCCA CTCTGTTATC 240 

AAAGGTATCG TACAGCAGCA GGACAACCTG CTGCGTGCAA TCCAGGCACA GCAGGAACTG 30 0 

CTGCGTCTGT CTGTATGGGG TATCCGTCAG CTGCGTGCTC GTCTGCTGGC ACTGGAAACC 3 60 

CTGATCCAGA ACCAGCAGCT GCTGAACCTG TGGGGCTGCA AAGGTCGTCT GATCTGCTAC 42 0 

ACCTCCGTTA AATGGAACGA AACCTGGCGT AACACCACCA ACATCAACCA GATCTGGGGT 480 

55 AACCTGACCT GGCAGGAATG GGACCAGCAG ATCGACAACG TTTCTTCCAC CATCTACGAA 540 

GAAATCCAGA AAGCTCAGGT TCAGCAGGAA CAGAACGAAA AAAAACTGCT GGAACTGGAC 60 0 

GAATGGGCTT CTCTGTGGAA CTGGCTGGAC ATCACCAAAT GGCTGCGTAA CATCCGTCAG 660 

GGCTACCAGC CGCTGTCCCT GCAGATCCCG ACCCGTCAGC AGTCTGAAGC TGAAACTCCG 720 

GGTCGTACCG GTGAAGGTGG TGGTGACGAA GGCCGTCCGC GTCTGATCCC GTCTCCGCAG 7 80 

60 GGTTTCCTGC CGCTGCTGTA CACCGACCTG CGTACCATCA TCCTGTGGTC CTACCACCTG 840 

CTGTCTAACC TGATCTCTGG TACTCAGACT GTTATCTCTC ACCTGCGTCT GGGTC TGTGG 900 

ATTC TGGGTC AGAAAATCAT CGACGCTTGC CGTATCTGCG CTGCTGTTAT CCACTACTGG 960 
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CTGCAGGAAC TGCAGAAATC CGCTACCTCC CTGATCGACA CCTTCGCTGT TGCAGTTGCT 
AACTGGACTG ACGACATCAT CCTGGGTATC CAGCGTCTGG GTCGTGGTAT CCTGAACATC 
CCGCGTCGTG TTCGCCAGGG CTTCGAACGC TCTCTGCTGT AATAG 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 373 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 

Met lie Gly Gly Asp Met Lys Asp lie Trp Arg Asn Glu Leu Phe Lys 

15 10 15 

Tyr Lys Val Val Arg Val Lys Pro Phe Ser Val Ala Pro Thr Pro lie 

20 25 30 

Ala Arg Pro Val lie Gly Thr Gly Thr His Arg Glu Lys Arg Ala Val 

35 40 45 

Gly Leu Gly Met Leu Phe Leu Gly Val Leu Ser Ala Ala Gly Ser Thr 

50 55 60 

Met Gly Ala Ala Ala Thr Ala Leu Thr Val Gin Thr His Ser Val lie 
65 70 75 80 

Lys Gly lie Val Gin Gin Gin Asp Asn Leu Leu Arg Ala lie Gin Ala 

85 90 95 

Gin Gin Glu Leu Leu Arg Leu Ser Val Trp Gly He Arg Gin Leu Arg 

100 105 HO 

Ala Arg Leu Leu Ala Leu Glu Thr Leu He Gin Asn Gin Gin Leu Leu 

115 120 125 

Asn Leu Trp Gly Cys Lys Gly Arg Leu He Cys Tyr Thr Ser Val Lys 

130 135 140 

Trp Asn Glu Thr Trp Arg Asn Thr Thr Asn He Asn Gin He Trp Gly 
145 150 155 160 

Asn Leu Thr Trp Gin Glu Trp Asp Gin Gin He Asp Asn Val Ser Ser 

165 170 175 

Thr He Tyr Glu Glu He Gin Lys Ala Gin Val Gin Gin Glu Gin Asn 

180 185 190 

Glu Lys Lys Leu Leu Glu Leu Asp Glu Trp Ala Ser Leu Trp Asn Trp 

195 200 205 

Leu Asp He Thr Lys Trp Leu Arg Asn He Arg Gin Gly Tyr Gin Pro 

210 215 220 

Leu Ser Leu Gin He Pro Thr Arg Gin Gin Ser Glu Ala Glu Thr Pro 
225 230 235 240 

Gly Arg Thr Gly Glu Gly Gly Gly Asp Glu Gly Arg Pro Arg Leu He 

245 250 255 

Pro Ser Pro Gin Gly Phe Leu Pro Leu Leu Tyr Thr Asp Leu Arg Thr 

260 265 270 

He He Leu Trp Ser Tyr His Leu Leu Ser Asn Leu He Ser Gly Thr 

275 280 285 

Gin Thr Val He Ser His Leu Arg Leu Gly Leu Trp He Leu Gly Gin 

290 295 300 

Lys He He Asp Ala Cys Arg He Cys Ala Ala Val He His Tyr Trp 
305 310 315 320 

Leu Gin Glu Leu Gin Lys Ser Ala Thr Ser Leu He Asp Thr Phe Ala 

325 330 335 

Val Ala Val Ala Asn Trp Thr Asp Asp He He Leu Gly He Gin Arg 

340 345 350 

Leu Gly Arg Gly He Leu Asn He Pro Arg Arg Val Arg Gin Gly Phe 
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15 



355 360 365 

Glu Arg Ser Leu Leu 
370 



(2) INFORMATION FOR SEQ ID NO: 53: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1860 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 



60 



ATGAGTTTTG TGGTCATTAT TCCCGCGCGC TACGCGTCGA CGCGTCTGCC CGGTAAACCA 

TTGGTTGATA TTAACGGCAA ACCCATGATT GTTCATGTTC TTGAACGCGC GCGTGAATCA 12 0 

GGTGC CGAGC GCATCATCGT GGCAACCGAT CATGAGGATG TTGCCCGCGC CGTTGAAGCC 180 

GCTGGCGGTG AAGTATGTAT GACGCGCGCC GATCATCAGT CAGGAACAGA ACGTCTGGCG 240 

20 GAAGTTGTCG AAAAATGCGC ATTCAGCGAC GACACGGTGA TCGTTAATGT GCAGGGTGAT 3 00 

GAAC CGATGA TCCCTGCGAC AATCATTCGT CAGGTTGCTG ATAACCTCGC TCAGCGTCAG 3 60 

GTGGGTATGA CGACTCTGGC GGTGC C AATC CACAATGCGG AAGAAGCGTT TAACCCGAAT 42 0 

GCGGTGAAAG TGGTTCTCGA CGCTGAAGGG TATGCACTGT ACTTCTCTCG CGCCACCATT 480 

CCTTGGGATC GTGATCGTTT TGCAGAAGGC CTTGAAACCG TTGGCGATAA CTTCCTGCGT 540 

25 CATCTTGGTA TTTATGGCTA CCGTGCAGGC TTTATCCGTC GTTACGTCAA CTGGCAGCCA 600 

AGTCCGTTAG AACACATCGA AATGTTAGAG CAGCTTCGTG TTC TGTGGT A CGGCGAAAAA 66 0 

ATCCATGTTG CTGTTGCTCA GGAAGTTCCT GGCACAGGTG TGGATACCCC TGAAGATCTC 72 0 

GACCCGTCGA CGAATTCTAT CGGTGGTGAC ATGAAAGACA TCTGGCGTAA CGAACTGTTC 780 

AAATACAAAG TTGTTCGTGT TAAACCGTTC TCTGTTGCTC CGACCCCGAT CGCTCGTCCG 840 

30 GTTATCGGTA CTGGCACCCA CCGTGAAAAA CGTGCTGTAG GTCTGGGTAT GCTGTTCCTG 900 

GGCGTTCTGT CTGCAGCAGG TTCCACTATG GGTGCTGCAG CTACCGCTCT GAC CGTAC AG 9 60 

ACCCACTCTG TTATCAAAGG TATCGTACAG CAGCAGGACA ACCTGCTGCG TGCAATCCAG 1020 

GCACAGCAGG AACTGCTGCG TCTGTCTGTA TGGGGTATCC GTCAGCTGCG TGCTCGTCTG 1080 

CTGGCACTGG AAACCCTGAT CCAGAACCAG CAGCTGCTGA ACCTGTGGGG CTGCAAAGGT 1140 

35 CGTCTGATCT GCTACACCTC CGTTAAATGG AACGAAACCT GGCGTAACAC CACCAACATC 12 0 0 

AACCAGATCT GGGGTAACCT GACCTGGCAG G AATGGG AC C AGCAGATCGA CAACGTTTCT 1260 

TCCACCATCT ACGAAGAAAT CCAGAAAGCT CAGGTTCAGC AGGAACAGAA CGAAAAAAAA 13 2 0 

CTGCTGGAAC TGGACGAATG GGCTTCTCTG TGGAACTGGC TGGACATCAC CAAATGGCTG 13 80 

CGTAACATCC GTCAGGGCTA CCAGCCGCTG TCCCTGCAGA TCCCGACCCG TCAGCAGTCT 1440 

40 G AAGC TG AAA CTCCGGGTCG TACCGGTGAA GGTGGTGGTG ACGAAGGCCG TCCGCGTCTG 15 0 0 

ATCCCGTCTC CGCAGGGTTT CCTGCCGCTG CTGTACACCG ACCTGCGTAC CATCATCCTG 1560 

TGGTCCTACC ACCTGCTGTC TAAC CTGATC TCTGGTACTC AG AC TGTT AT CTCTCACCTG 1620 

CGTCTGGGTC TGTGGATTCT GGGTCAGAAA ATCATCGACG CTTGC CGTAT CTGCGCTGCT 168 0 

GTTATCCACT ACTGGCTGCA GGAAC TGC AG AAATCCGCTA CCTCCCTGAT CGACACCTTC 1740 

45 GCTGTTGCAG TTGCTAACTG GACTGACGAC ATCATCCTGG GTATCCAGCG TCTGGGTCGT 180 0 

GGTATCCTGA ACATCCCGCG TCGTGTTCGC CAGGGCTTCG AACGCTCTCT GCTGTAATAG 1860 



(2) INFORMATION FOR SEQ ID NO: 54: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 618 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

60 

Met Ser Phe Val Val He He Pro Ala Arg Tyr Ala Ser Thr Arg Leu 
15 10 15 



-73- 



Pro Gly Lys Pro 
20 

Val Leu Glu Arg 
35 

Thr Asp His Glu 
50 

Val Cys Met Thr 

65 

Glu Val Val Glu 

Val Gin Gly Asp 
100 

Ala Asp Asn Leu 
115 

Pro lie His Asn 
130 

Val Leu Asp Ala 
145 

Pro Trp Asp Arg 

Asn Phe Leu Arg 
180 

Arg Arg Tyr Val 
195 

Leu Glu Gin Leu 
210 

Val Ala Gin Glu 
225 

Asp Pro Ser Thr 

Asn Glu Leu Phe 
260 

Ala Pro Thr Pro 
275 

Glu Lys Arg Ala 
290 

Ala Ala Gly Ser 
305 

Thr His Ser Val 

Arg Ala lie Gin 
340 

lie Arg Gin Leu 
355 

Asn Gin Gin Leu 

370 

Tyr Thr Ser Val 
385 

Asn Gin lie Trp 

Asp Asn Val Ser 
420 

Gin Gin Glu Gin 
435 

Ser Leu Trp Asn 
450 

Gin Gly Tyr Gin 
465 

Glu Ala Glu Thr 

Arg Pro Arg Leu 
500 



Leu Val Asp He 

Ala Arg Glu Ser 
40 

Asp Val Ala Arg 
55 

Arg Ala Asp His 
70 

Lys Cys Ala Phe 
85 

Glu Pro Met He 

Ala Gin Arg Gin 
120 

Ala Glu Glu Ala 
135 

Glu Gly Tyr Ala 
150 

Asp Arg Phe Ala 
165 

His Leu Gly He 

Asn Trp Gin Pro 
200 

Arg Val Leu Trp 
215 

Val Pro Gly Thr 
230 

Asn Ser He Gly 
245 

Lys Tyr Lys Val 

He Ala Arg Pro 
280 

Val Gly Leu Gly 
295 

Thr Met Gly Ala 
310 

He Lys Gly He 
325 

Ala Gin Gin Glu 

Arg Ala Arg Leu 

360 

Leu Asn Leu Trp 
375 

Lys Trp Asn Glu 
390 

Gly Asn Leu Thr 
405 

Ser Thr He Tyr 

Asn Glu Lys Lys 
440 

Trp Leu Asp He 
455 

Pro Leu Ser Leu 
470 

Pro Gly Arg Thr 
485 

He Pro Ser Pro 



Asn Gly Lys Pro 
25 

Gly Ala Glu Arg 

Ala Val Glu Ala 
60 

Gin Ser Gly Thr 
75 

Ser Asp Asp Thr 
90 

Pro Ala Thr He 
105 

Val Gly Met Thr 

Phe Asn Pro Asn 
140 

Leu Tyr Phe Ser 
155 

Glu Gly Leu Glu 
170 

Tyr Gly Tyr Arg 
185 

Ser Pro Leu Glu 

Tyr Gly Glu Lys 
220 

Gly Val Asp Thr 
235 

Gly Asp Met Lys 
250 

Val Arg Val Lys 
265 

Val He Gly Thr 

Met Leu Phe Leu 
300 

Ala Ala Thr Ala 
315 

Val Gin Gin Gin 
330 

Leu Leu Arg Leu 
345 

Leu Ala Leu Glu 

Gly Cys Lys Gly 
380 

Thr Trp Arg Asn 
395 

Trp Gin Glu Trp 
410 

Glu Glu He Gin 
425 

Leu Leu Glu Leu 

Thr Lys Trp Leu 
460 

Gin He Pro Thr 

• 475 
Gly Glu Gly Gly 
490 

Gin Gly Phe Leu 
505 



Met He Val His 
30 

He He Val Ala 
45 

Ala Gly Gly Glu 

Glu Arg Leu Ala 
80 

Val He Val Xsn 
95 

He Arg Gin Val 
110 

Thr Leu Ala Val 
125 

Ala Val Lys Val 

Arg Ala Thr He 
160 

Thr Val Gly Asp 
175 

Ala Gly Phe He 
190 

His He Glu Met 
205 

He His Val Ala 

Pro Glu Asp Leu 
240 

Asp lie Trp Arg 
255 

Pro Phe Ser Val 
270 

Gly Thr His Arg 
285 

Gly Val Leu Ser 

Leu Thr Val Gin 

320 

Asp Asn Leu Leu 

335 

Ser Val Trp Gly 
350 

Thr Leu He Gin 
365 

Arg Leu He Cys 

Thr Thr Asn He 
400 

Asp Gin Gin He 
415 

Lys Ala Gin Val 
430 

Asp Glu Trp Ala 
445 

Arg Asn He Arg 

Arg Gin Gin Ser 
480 

Gly Asp Glu Gly 
495 

Pro Leu Leu Tyr 
510 
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Thr 


Asp 


Leu 


Arg 


Thr 


He 


He 


Leu 


Trp 


Ser 


Tyr 


His 


Leu 


Leu 


Ser 


Asn 




515 








520 










525 








Leu 


He 


Ser 


Gly 


Thr 


Gin 


Thr 


Val 


He 


Ser 


His 


Leu 


Arg 


Leu 


Gly 


Leu 




530 








535 










540 










Trp 


He 


Leu 


Gly 


Gin 


Lys 


He 


He 


Asp 


Ala 


Cys 


Arg 


He 


Cys 


Ala 


Ala 


545 










550 










555 










C C C\ 

b b U 


Val 


He 


His 


Tyr 


Trp 


Leu 


Gin 


Glu 


Leu 


Gin 


Lys 


Ser 


Ala 


Thr 


Ser 


Leu 








565 










570 










575 




He 


Asp 


Thr 


Phe 


Ala 


Val 


Ala 


Val 


Ala 


Asn 


Trp 


Thr 


Asp 


Asp 


lie 


Tl 

lie 






580 










585 










590 






Leu 


Gly 


He 


Gin 


Arg 


Leu 


Gly 


Arg 


Gly 


He 


Leu 


Asn 


He 


Pro 


Arg 


Arg 




595 










600 










605 








Val 


Arg 
610 


Gin 


Gly 


Phe 


Glu 


Arg 
615 


Ser 


Leu 


Leu 















(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 4 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Met Ser Phe Val Val He He Pro Ala Arg Tyr Ala Ser Thr Arg Leu 

15 10 15 

Pro Gly Lys Pro Leu Val Asp He Asn Gly Lys Pro Met He Val His 
30 20 25 30 

Val Leu Glu Arg Ala Arg Glu Ser Gly Ala Glu Arg He He Val Ala 

35 40 45 

Thr Asp His Glu Asp Val Ala Arg Ala Val Glu Ala Ala Gly Gly Glu 
50 55 60 

35 Val Cys Met Thr Arg Ala Asp His Gin Ser Gly Thr Glu Arg Leu Ala 
65 70 75 80 

Glu Val Val Glu Lys Cys Ala Phe Ser Asp Asp Thr Val He Val Asn 

85 90 95 

Val Gin Gly Asp Glu Pro Met He Pro Ala Thr He He Arg Gin Val 
40 100 105 HO 

Ala Asp Asn Leu Ala Gin Arg Gin Val Gly Met Thr Thr Leu Ala Val 

115 120 125 

Pro He His Asn Ala Glu Glu Ala Phe Asn Pro Asn Ala Val Lys Val 
130 135 140 

45 Val Leu Asp Ala Glu Gly Tyr Ala Leu Tyr Phe Ser Arg Ala Thr He 
145 150 155 160 

Pro Trp Asp Arg Asp Arg Phe Ala Glu Gly Leu Glu Thr Val Gly Asp 

165 170 175 

Asn Phe Leu Arg His Leu Gly He Tyr Gly Tyr Arg Ala Gly Phe He 
50 180 185 190 

Arg Arg Tyr Val Asn Trp Gin Pro Ser Pro Leu Glu His He Glu Met 

195 200 205 

Leu Glu Gin Leu Arg Val Leu Trp Tyr Gly Glu Lys He His Val Ala 
210 215 220 

55 Val Ala Gin Glu Val Pro Gly Thr Gly Val Asp Thr Pro Glu Asp Leu 
225 230 . 235 240 

Asp Pro Ser Thr Asn Ser Met Glu Gly Glu Leu Thr Cys Asn Ser Thr 

245 250 255 

Val Thr Ser He He Ala Asn He Asp Ser Asp Gly Asn Gin Thr Asn 
60 260 265 270 

He Thr Phe Ser Ala Glu Val Ala Glu Leu Tyr Arg Leu Glu Leu Gly 
275 280 285 
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Asp 


Tyr 


Lys 


Leu 


Tip 

lie 


Glu 


Val 


Thr 


Pro 


He 


Gly 


Phe 


Ala 


Pro 


Thr 


Lys 


O Q n 

zy u 








295 










300 










Glu 


Lys 


Arg 


Tyr 




Ser 


Ala 


Pro 


Val 


Arg 


Asn 


Lys 


Arg 


Gly 


Val 


Phe 


305 




310 










315 










320 


Val 


Leu 


tjj.y 


rue 


Leu 


Gly 


Phe 


Leu 


Ala 


Thr 


Ala 


Gly 


Ser 


Ala 


Met 


Gly 








325 










330 










335 




Ala 


Ala 


Ser 


Leu 


Thr 


Leu 


Ser 


Ala 


Gin 


Ser 


Arg 


Thr 


Leu 


Leu 


Ala 


Gly 






.3 4fc U 










345 










350 






lie 


Val 


Gin 
355 


Lrin 


Qjr±n 


bin 


Cl -n 


Leu 
3 60 


Leu 


Asp 


Val 


Val 


Lys 
365 


Arg 


Gin 


Gin 


Glu 


Met 
370 


Leu 


Arg 


Leu 


Thr 


Val 

375 


irp 


uiy 


Thr 


Lys 


Asn 
380 


Leu 


Gin 


Ala 


Arg 


Val 


Thr 


Ala 


lie 


(jrlU 


Lys 


Tyr 


Leu 


T AT" CI 


Asp 


Gin 


Ala 


Gin 


Leu 


Asn 


Ser 


-Job 










390 










395 










400 


Trp 


Gly 


Cys 


Aia 


rue 


Arg 


bill 


Val 


Cys 


His 


Thr 


Thr 


Val 


Pro 


Trp 


Val 




405 










410 










415 




Asn 




Ser 


Leu 


Thr 


Pro 


Asp 


Trp 


Asn 


Asn 


Met 


Thr 


Trp 


Gin 


Glu 


Trp 






420 










425 










430 






Glu 


Lys 


Arg 


Val 


His 


Tyr 


Leu 


Glu 


Ala 


Asn 


He 


Ser 


Gin 


Ser 


Leu 


Glu 




435 










440 










445 








Gin 


Ala 
450 


Gin 


He 


Gin 


Gin 


Glu 
455 


Lys 


Asn 


Met 


Tyr 


Glu 
460 


Leu 


Gin 


Lys 


Leu 



Asn Ser 
465 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 491 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 



Met 


Ser 


Phe 


Val 


Val 


He 


He 


Pro 


Ala 


Arg 


Tyr 


Ala 


Ser 


Thr 


Arg 


Leu 


1 






5 










10 










15 




Pro 


Gly 


Lys 


Pro 


Leu 


Val 


Asp 


He 


Asn 


Gly 


Lys 


Pro 


Met 


He 


Val 


His 




20 










25 










30 






Val 


Leu 


Glu 


Arg 


Ala 


Arg 


Glu 


Ser 


Gly 


Ala 


Glu 


Arg 


He 


He 


Val 


Ala 






35 








40 










45 






Glu 


Thr 


Asp 
50 


His 


Glu 


Asp 


Val 


Ala 
55 


Arg 


Ala 


Val 


Glu 


Ala 
60 


Ala 


Gly 


Gly 


Val 


Cys 


Met 


Thr 


Arg 


Ala 


Asp 


His 


Gin 


Ser 


Gly 


Thr 


Glu 


Arg 


Leu 


Ala 


65 








70 










75 










80 


Glu 


Val 


Val 


Glu 


Lys 
85 


Cys 


Ala 


Phe 


Ser 


Asp 
90 


Asp 


Thr 


Val 


He 


Val 
95 


Asn 


Val 


Gin 


Gly 


Asp 


Glu 


Pro 


Met 


He 


Pro 


Ala 


Thr 


He 


He 


Arg 


Gin 


Val 






100 










105 










110 




Val 


Ala 


Asp 


Asn 


Leu 


Ala 


Gin 


Arg 


Gin 


Val 


Gly 


Met 


Ala 


Thr 


Leu 


Ala 




115 










120 










125 








Pro 


He 


His 


Asn 


Ala 


Glu 


Glu 


Ala 


Phe 


Asn 


Pro 


Asn 


Ala 


Val 


Lys 


Val 




130 










135 










140 








He 


Val 


Leu 


Asp 


Ala 


Glu 


Gly 


Tyr 


Ala 


Leu 


Tyr 


Phe 


Ser 


Arg 


Ala 


Thr 


145 








150 










155 










160 


Pro 


Trp 


Asp 


Arg 


Asp 


Arg 


Phe 


Ala 


Glu 


Gly 


Leu 


Glu 


Thr 


Val 


Gly 


Asp 






165 










170 










175 




Asn 


Phe 


Leu 


Arg 
180 


His 


Leu 


Gly 


He 


Tyr 
185 


Gly 


Tyr 


Arg 


Ala 


Gly 
190 


Phe 


He 


Arg 


Arg 


Tyr 


Val 


Asn 


Trp 


Gin 


Pro 


Ser 


Pro 


Leu 


Glu 


His 


He 


Glu 


Met 


195 










200 










205 
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Leu Glu Gin Leu Arg Val Leu Trp Tyr Gly Glu Lys lie His Val Ala 

210 215 220 

Val Ala Gin Glu Val Pro Gly Thr Gly Val Asp Thr Pro Glu Asp Pro 
225 230 235 240 

5 Ser Thr Ala Leu Met Lys He Pro Gly Asp Pro Gly Gly Gly Asp Met 

245 250 255 

Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys He 

260 265 270 

Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin 
10 275 280 285 

Arg Glu Lys Arg Ala Val Gly He Gly Ala Leu Phe Leu Gly Phe Leu 

290 295 300 

Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Met Thr Leu Thr Val 
305 310 315 320 

15 Gin Ala Arg Gin Leu Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu 

325 330 335 

Leu Arg Ala He Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp 

340 345 350 

Gly He Lys Gin Leu Gin Ala Arg He Leu Ala Val Glu Arg Tyr Leu 
20 355 360 365 

Lys Asp Gin Gin Leu Leu Gly He Trp Gly Cys Ser Gly Lys Leu lie 

370 375 380 

Cys Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu 
385 390 395 400 

25 Glu Gin He Trp Asn Asn Met Thr Trp Met Glu Trp Asp Arg Glu He 

405 410 415 

Asn Asn Tyr Thr Ser Leu He His Ser Leu He Glu Glu Ser Gin Asn 

420 425 430 

Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp Val 
30 435 440 445 

Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr His Leu 

450 455 460 

Pro He Pro Arg Gly Pro Asp Arg Pro Glu Gly He Glu Lys Lys Ala 
465 470 475 480 

35 Ala Asn Val Thr Val Thr Val Pro Phe Val Trp 

485 490 



40 



(2) INFORMATION FOR SEQ ID NO: 57: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 651 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 57 : 

ATGATCGGTG GTGACATGAA AGACATCTGG CGTAACGAAC TGTTCAAATA CAAAGTTGTT 60 

50 CGTGTTAAAC CGTTCTCTGT TGCTCCGACC CCGATCGCTC GTCCGGTTAT CGGTACTGGC 12 0 

ACCCACCGTG AAAAACGTGC TGTAGGTCTG GGTATGCTGT TCCTGGGCGT TCTGTCTGCA 180 

GCAGGTTCCA CTATGGGTGC TGCAGCTACC GCTCTGACCG TACAGACCCA CTCTGTTATC 240 

AAAGGTATCG TACAGCAGCA GGACAACCTG CTGCGTGCAA TCCAGGCACA GCAGGAACTG 3 00 

C TGCGTC TGT CTGTATGGGG TATCCGTCAG CTGCGTGCTC GTCTGCTGGC AC TGGAAAC C 3 60 

55 CTGATCCAGA ACCAGCAGCT GCTGAACCTG TGGGGCTGCA AAGGTCGTCT GATCTGCTAC 42 0 

ACCTCCGTTA AATGGAACGA AACCTGGCGT AACACCACCA ACATCAACCA GATCTGGGGT 480 

AACCTGACCT GGCAGGAATG GGACCAGCAG ATCGACAACG TTTCTTCCAC CATCTACGAA 540 

GAAATCCAGA AAGCTCAGGT TCAGCAGGAA CAGAACGAAA AAAAACTGCT GGAACTGGAC 60 0 

GAATGGGCTT CTCTGTGGAA CTGGCTGGAC ATCACCAAAT GGC TGTAAT A G 651 

(2) INFORMATION FOR SEQ ID NO: 58: 



60 
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6165.US.01 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 215 amino acids 

(B) TYPE: amino acid 

5 (C) STRANDEDNESS: single 

( D ) TO POLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

10 Met He Gly Gly Asp Met Lys Asp He Trp Arg Asn Glu Leu Phe Lys 
15 10 15 

Tvr Lys Val Val Arg Val Lys Pro Phe Ser Val Ala Pro Thr Pro He 

20 25 30 

Ala Arg Pro Val He Gly Thr Gly Thr His Arg Glu Lys Arg Ala Val 
15 35 40 45 

Gly Leu Gly Met Leu Phe Leu Gly Val Leu Ser Ala Ala Gly Ser Thr 

50 55 60 

Met Gly Ala Ala Ala Thr Ala Leu Thr Val Gin Thr His Ser Val He 
65 70 75 80 

20 Lys Gly He Val Gin Gin Gin Asp Asn Leu Leu Arg Ala He Gin Ala 

85 90 95 

Gin Gin Glu Leu Leu Arg Leu Ser Val Trp Gly He Arg Gin Leu Arg 

100 105 HO 

Ala Arg Leu Leu Ala Leu Glu Thr Leu He Gin Asn Gin Gin Leu Leu 
25 H5 120 125 

Asn Leu Trp Gly Cys Lys Gly Arg Leu He Cys Tyr Thr Ser Val Lys 

130 135 140 

Trp Asn Glu Thr Trp Arg Asn Thr Thr Asn He Asn Gin He Trp Gly 
145 150 155 160 

30 Asn Leu Thr Trp Gin Glu Trp Asp Gin Gin He Asp Asn Val Ser Ser 

165 170 175 

Thr He Tyr Glu Glu He Gin Lys Ala Gin Val Gin Gin Glu Gin Asn 

180 185 190 

Glu Lys Lys Leu Leu Glu Leu Asp Glu Trp Ala Ser Leu Trp Asn Trp 
35 195 200 205 

Leu Asp He Thr Lys Trp Leu 



40 (2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1386 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:59: 

50 ATGAGTTTTG TGGTCATTAT TCCCGCGCGC TACGCGTCGA CGCGTCTGCC CGGTAAACCA 6 0 

TTGGTTGATA TTAACGGCAA ACCCATGATT GTTCATGTTC TTGAACGCGC GCGTGAATCA 12 0 

GGTGCCGAGC GCATCATCGT GGCAACCGAT CATGAGGATG TTGCCCGCGC CGTTGAAGCC 180 

GCTGGCGGTG AAGTATGTAT GACGCGCGCC GATCATCAGT CAGGAACAGA ACGTCTGGCG 240 

GAAGTTGTCG AAAAATGCGC ATTCAGCGAC GACACGGTGA TCGTTAATGT GCAGGGTGAT 3 00 

55 GAAC CGATGA TCCCTGCGAC AATCATTCGT CAGGTTGCTG ATAACCTCGC TCAGCGTCAG 3 60 

GTGGGTATGA CGACTCTGGC GGTGCCAATC CACAATGCGG AAGAAGCGTT TAACCCGAAT 420 

GCGGTGAAAG TGGTTCTCGA CGCTGAAGGG TATGC AC TGT ACTTCTCTCG CGCCACCATT 48 0 

CCTTGGGATC GTGATCGTTT TGCAGAAGGC CTTGAAACCG TTGGCGATAA CTTCCTGCGT 54 0 

CATCTTGGTA TTTATGGCTA CCGTGCAGGC TTTATCCGTC GTTACGTCAA CTGGCAGCCA 60 0 

60 AGTC CGTT AG AACACATCGA AATGTTAGAG CAGCTTCGTG TTCTGTGGTA CGGCGAAAAA 6 60 

ATC C ATGTTG CTGTTGCTCA GGAAGTTCCT GGCACAGGTG TGGATACCCC TGAAGATCTC 72 0 

GACCCGTCGA CGAATTCTAT CGGTGGTGAC ATGAAAGACA TCTGGCGTAA CGAACTGTTC 7 80 
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AAATACAAAG TTGTTCGTGT TAAACCGTTC TCTGTTGCTC CGACCCCGAT CGCTCGTCCG 840 

GTTATCGGTA CTGGCACCCA CCGTGAAAAA CGTGCTGTAG GTCTGGGTAT GCTGTTCCTG 900 

GGCGTTCTGT CTGCAGCAGG TTCCACTATG GGTGCTGCAG CTACCGCTCT GACCGTACAG 960 

ACCCACTCTG TTATCAAAGG TATCGTACAG CAGCAGGACA ACCTGCTGCG TGCAATCCAG 1020 

5 GCACAGCAGG AACTGCTGCG TCTGTCTGTA TGGGGTATCC GTCAGCTGCG TGCTCGTCTG 1080 

CTGGCACTGG AAAC CCTGAT CCAGAACCAG CAGCTGCTGA AC CTGTGGGG CTGCAAAGGT 1140 

CGTCTGATCT GCTACACCTC CGTTAAATGG AACGAAACCT GGCGTAACAC CACCAACATC 12 00 

AACCAGATCT GGGGTAACCT GACCTGGCAG GAATGGG AC C AGCAGATCGA CAACGTTTCT 12 60 

TCCACCATCT ACGAAGAAAT CCAGAAAGCT CAGGTTCAGC AGGAACAGAA CGAAAAAAAA 132 0 

10 CTGCTGGAAC TGGACGAATG GGCTTCTCTG TGGAACTGGC TGGACATCAC CAAATGGCTG 13 80 

TAATAG 1386 



15 



(2) INFORMATION FOR SEQ ID NO: 60: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 460 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingle 
20 (D) TOPOLOGY: linear 



I 25 



(ii) MOLECULE TYPE: Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 



Met Ser Phe Val Val He He Pro Ala Arg Tyr Ala Ser Thr Arg Leu 

1 S 10 15 

Pro Gly Lys Pro Leu Val Asp He Asn Gly Lys Pro Met lie Val His 
20 25 30 

30 Val Leu Glu Arg Ala Arg Glu Ser Gly Ala Glu Arg He He Val Ala 
35 40 45 

Thr Asp His Glu Asp Val Ala Arg Ala Val Glu Ala Ala Gly Gly Glu 

50 55 60 

Val Cys Met Thr Arg Ala Asp His Gin Ser Gly Thr Glu Arg Leu Ala 
35 65 70 75 80 

Glu Val Val Glu Lys Cys Ala Phe Ser Asp Asp Thr Val He Val Asn 

85 90 95 

Val Gin Gly Asp Glu Pro Met He Pro Ala Thr He He Arg Gin Val 
100 105 HO 

40 Ala Asp Asn Leu Ala Gin Arg Gin Val Gly Met Thr Thr Leu Ala Val 
115 120 125 

Pro He His Asn Ala Glu Glu Ala Phe Asn Pro Asn Ala Val Lys Val 

130 135 140 

Val Leu Asp Ala Glu Gly Tyr Ala Leu Tyr Phe Ser Arg Ala Thr He 
45 145 150 155 160 

Pro Trp Asp Arg Asp Arg Phe Ala Glu Gly Leu Glu Thr Val Gly Asp 

165 170 175 

Asn Phe Leu Arg His Leu Gly He Tyr Gly Tyr -Arg Ala Gly Phe He 
180 185 190 

50 Arg Arg Tyr Val Asn Trp Gin Pro Ser Pro Leu Glu His He Glu Met 
195 200 205 

Leu Glu Gin Leu Arg Val Leu Trp Tyr Gly Glu Lys He His Val Ala 

210 215 220 

Val Ala Gin Glu Val Pro Gly Thr Gly Val Asp Thr Pro Glu Asp Leu 
55 225 230 • 235 240 

Asp Pro Ser Thr Asn Ser He Gly Gly Asp Met Lys Asp He Trp Arg 

245 250 255 

Asn Glu Leu Phe Lys Tyr Lys Val Val Arg Val Lys Pro Phe Ser Val 
260 265 270 

60 Ala Pro Thr Pro He Ala Arg Pro Val He Gly Thr Gly Thr His Arg 
275 280 285 

Glu Lys Arg Ala Val Gly Leu Gly Met Leu Phe Leu Gly Val Leu Ser 
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290 295 300 



Ala 


Ala 


Gly 


Ser 


Thr 


Met 


Gly 


Ala 


Ala 


Ala 


Thr 


Til, 

Ala 


Leu 


1 X IX. 


Val 


Gin 


305 








310 










Jlo 












Thr 


His 


Ser 


Val 


He 


Lys 


Gly 


He 


Val 


Gin 


Gin 


Gin 


Asp 


Asn 


Leu 


Leu 










325 










33 0 










~i ^ c; 




Arg 


Ala 


He 


Gin 


Ala 


Gin 


Gin 


Glu 


Leu 


Leu 


Arg 


Leu 


Ser 


val 


Trp 


pi ,r 






340 










345 
















He 


Arg 


Gin 


Leu 


Arg 


Ala 


Arg 


Leu 


Leu 


Ala 


Leu 


Glu 


Thr 


Leu 


lie 


L?±n 




355 










360 










365 








Asn 


Gin 


Gin 


Leu 


Leu 


Asn 


Leu 


Trp 


Gly 


Cys 


Lys 


Gly 


Arg 


Leu 


x re 


Cys 




370 










375 










380 










Tyr 


Thr 


Ser 


Val 


Lys 


Trp 


Asn 


Glu 


Thr 


Trp 


Arg 


Asn 


Thr 


Thr 


Asn 


lie 


385 










390 










o r» cr 












Asn 


Gin 


He 


Trp 


Gly 


Asn 


Leu 


Thr 


Trp 




Glu 


Tttd 
■ LJ -k J 


Asp 


Gin 


Gin 


He 








405 










410 










415 




Asp 


Asn 


Val 


Ser 


Ser 


Thr 


He 


Tyr 


Glu 


Glu 


He 


Gin 


Lys 


Ala 


Gin 


Val 






420 










425 










430 






Gin 


Gin 


Glu 


Gin 


Asn 


Glu 


Lys 


Lys 


Leu 


Leu 


Glu 


Leu 


Asp 


Glu 


Trp 


Ala 






435 










440 










445 








Ser 


Leu 


Trp 


Asn 


Trp 


Leu 


Asp 


He 


Thr 


Lys 


Trp 


Leu 











450 455 460 



(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 873 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 



Met 


He 


Val 


Thr 


Met 


Arg 


Ala 


Met 


Gly 


Lys 


Arg 


Asn 


Arg 


Lys 


Leu 


Gly 


1 








5 








10 










15 




He 


Leu 


Tyr 


He 


Val 


Met 


Ala 


Leu 


He 


He 


Pro 


Cys 


Leu 


Ser 


Ser 


Ser 






20 










25 










30 




Ala 


Gin 


Leu 


Tyr 


Ala 


Thr 


Val 


Tyr 


Ala 


Gly 


Val 


Pro 


Val 


Trp 


Glu 


Asp 






35 










40 










45 






Glu 


Ala 


Pro 


Val 


Leu 


Phe 


Cys 


Ala 


Ser 


Asp 


Ala 


Asn 


Leu 


Thr 


Ser 


Thr 




50 










55 










60 








Thr 


Lys 


His 


Asn 


Val 


Trp 


Ala 


Ser 


Gin 


Ala 


cys 


Val 


Pro 


Thr 


Asp 


Pro 


65 










70 










75 










80 


Pro 


His 


Glu 


Tyr 


Leu 


Leu 


Thr 


Asn 


Val 


Thr 


As P 


Asn 


Phe 


Asn 


He 


Trp 








85 










90 










95 




Glu 


Asn 


Tyr 


Met 


Val 


Glu 


Gin 


Met 


Gin 


Glu 


Asp 


He 


He 


Ser 


Leu 


Trp 






100 










105 










110 






Asp 


Gin 


Ser 


Leu 


Lys 


Pro 


Cys 


He 


Gin 


Met 


Thr 


Phe 


Met 


Cys 


He 


Gin 




115 










120 










125 








Met 


Asn 


Cys 


Thr 


Asp 


He 


Lys 


Asn 


Asn 


Asn 


Thr 


Ser 


Gly 


Thr 


Glu 


Asn 




130 








135 










140 








He 


Arg 


Thr 


Ser 


Ser 


Ser 


Glu 


Asn 


Pro 


Met 


Lys 


Thr 


Cys 


Glu 


Phe 


Asn 


145 










150 










155 










160 


Thr 


Thr 


Val 


Leu 


Lys 
165 


Asp 


Lys 


Lys 


Glu 


Lys 
170 


Lys 


Gin 


Ala 


Leu 


Phe 
175 


Tyr 


Val 


Ser 


Asp 


Leu 


Thr 


Lys 


Leu 


Ala 


Asp 


Asn 


Asn 


Thr 


Thr 


Asn 


Thr 


Met 






180 










185 










190 






Tyr 


Thr 


Leu 


He 


Asn 


Cys 


Asn 


Ser 


Thr 


Thr 


He 


Lys 


Gin 


Ala 


Cys 


Pro 




195 










200 










205 








Lys 


Val 


Ser 


Phe 


Glu 


Pro 


He 


Pro 


He 


Tyr 


Tyr 


Cys 


Ala 


Pro 


Ala 


Gly 
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210 215 220 

Tyr Ala He Phe Lys Cys Asn Ser Ala Glu Phe Asn Gly Thr Gly Lys 
225 230 235 240 

Cys Ser Asn He Ser Val Val Thr Cys Thr His Gly He Lys Pro Thr 
5 245 250 255 

Val Ser Thr Gin Leu He Leu Asn Gly Thr Leu Ser Lys Glu Lys He 

260 265 270 

Arg He Met Gly Lys Asn He Ser Asp Ser Gly Lys Asn He He Val 
275 280 285 

10 Thr Leu Ser Ser Asp He Glu He Thr Cys Val Arg Pro Gly Asn Asn 
290 295 300 

Gin Thr Val Gin Glu Met Lys He Gly Pro Met Ala Trp Tyr Ser Met 
305 310 315 320 

Ala Leu Gly Thr Gly Ser Asn Arg Ser Arg Val Ala Tyr Cys Gin Tyr 
15 325 330 335 

Asn Thr Thr Glu Trp Glu Lys Ala Leu Lys Asn Thr Ala Glu Arg Tyr 

340 345 350 

Leu Glu Leu He Asn Asn Thr Glu Gly Asn Thr Thr Met He Phe Asn 
355 360 365 

20 Arg Ser Gin Asp Gly Ser Asp Val Glu Val Thr His Leu His Phe Asn 
370 375 380 

Cvs His Gly Glu Phe Phe Tyr Cys Asn Thr Ser Glu Met Phe Asn Tyr 
385 390 395 400 

Thr Phe Leu Cys Asn Gly Thr Asn Cys Asn Asn Thr Gin Ser He Asn 
25 405 410 415 

Ser Ala Asn Gly Met He Pro Cys Lys Leu Lys Gin Val Val Arg Ser 

420 425 430 

Trp Met Arg Gly Gly Ser Gly Leu Tyr Ala Pro Pro He Pro Gly Asn 
435 440 445 

30 Leu Thr Cys He Ser His He Thr Gly Met He Leu Gin Met Asp Ala 
450 455 460 

Pro Trp Asn Lys Thr Glu Asn Thr Phe Arg Pro He Gly Gly Asp Met 
465 470 475 480 

Lys Asp He Trp Arg Asn Glu Leu Phe Lys Tyr Lys Val Val Arg Val 
35 485 490 495 

Lys Pro Phe Ser Val Ala Pro Thr Pro He Ala Arg Pro Val He Gly 

500 505 510 

Thr Gly Thr His Arg Glu Lys Arg Ala Val Gly Leu Gly Met Leu Phe 
515 520 525 

40 Leu Gly Val Leu Ser Ala Ala Gly Ser Thr Met Gly Ala Ala Ala Thr 
530 535 540 

Ala Leu Thr Val Gin Thr His Ser Val He Lys Gly He Val Gin Gin 
545 550 555 560 

Gin Asp Asn Leu Leu Arg Ala He Gin Ala Gin Gin Glu Leu Leu Arg 
45 565 570 575 

Leu Ser Val Trp Gly He Arg Gin Leu Arg Ala Arg Leu Leu Ala Leu 

580 585 590 

Glu Thr Leu He Gin Asn Gin Gin Leu Leu Asn Leu Trp Gly Cys Lys 
595 600 605 

50 Gly Arg Leu He Cys Tyr Thr Ser Val Lys Trp Asn Glu Thr Trp Arg 
610 615 620 

Asn Thr Thr Asn He Asn Gin He Trp Gly Asn Leu Thr Trp Gin Glu 
625 630 635 640 

Trp Asp Gin Gin He Asp Asn Val Ser Ser Thr He Tyr Glu Glu He 
55 645 650 655 

Gin Lys Ala Gin Val Gin Gin Glu Gin Asn Glu Lys Lys Leu Leu Glu 

660 665 670 

Leu Asp Glu Trp Ala Ser Leu Trp Asn Trp Leu Asp He Thr Lys Trp 
675 680 685 

60 Leu Trp Tyr He Lys He Ala He He He Val Gly Ala Leu He Gly 
690 695 700 

Val Arg He Val Met He Val Leu Asn Leu Val Arg Asn He Arg Gin 
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705 










71 n 

/ X v 










715 










720 


Gly 


Tyr 


Gin 


Pro 


Leu 


Ser 


Leu 


VJ ±11 


lie 


Pro 


Thr 


Arg 


Gin 


Gin 


Ser 


Glu 






(AO 










730 










735 




Ala 


Glu 


Thr 


Pro 


Gly 


Arg 


Tnr 






Gly 


Gly 


Gly 


Asp 


Glu 


Gly 


Arg 








740 








7 AR 










750 






Pro 


Arg 


Leu 


He 


Pro 


Ser 


Pro 


pin 


\j xy 


±r I itr 


Leu 


Pro 


Leu 


Leu 


Tvr 


Thr 




755 




















765 








Asp 


Leu 


Arg 


Thr 


He 


Tl - 

lie 


Leu 


Trp 


Ser 


iyr 


n x 0 


Leu 


Leu 


Ser 


Asn 


Leu 


77 0 








77 R 

I/O 










780 










lie 


Ser 


Gly 


Thr 


Gin 


Thr 


Val 




Ser 


His 


Leu 


Arg 


Leu 


Glv 


Leu 


Trp 


785 








7QH 










795 










800 


lie 


Leu 


Gly 


Gin 


Lys 


lie 


Tl A 

lie 


Asp 




Cy s 


Arg 


He 


Cys 


Ala 


Ala 


Val 








O V -> 










810 










815 




lie 


rllS 


Tyr 


irp 


Leu 


Gin 


Glu 


Leu 


Gin 




Ser 


Ala 


Thr 


Ser 


Leu 


He 






oon 










825 










830 






Asp 


Thr 


Jr ne 


i-iXcL 


v a x 


Ala 


Val 


Ala 


Asn 


Trp 


Thr 


Asp 


Asp 


He 


He 


Leu 




835 










840 










845 






Val 


Gly 


He 


Gin 


Arg 


Leu 


Gly 


Arg 


Gly 


He 


Leu 


Asn 


He 


Pro 


Arg 


Arg 


850 










855 










860 










Arg 


Gin 


Gly 


Phe 


Glu 


Arg 


Ser 


Leu 


Leu 
















865 










870 























(2) INFORMATION FOR SEQ ID NO : 62 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
YCTYTAGAGA GTGTCCCATT 



(2) INFORMATION FOR SEQ ID NO : 63 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 63 : 
GTGCTWCCTG CTGCACTTA 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
AAGTTGCTCA AGAGGTGGTA 
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(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
CCTTAGAGGC ACTTGAGGT 



(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
CCARAGCAGT AAGTAACGC 



(2) INFORMATION FOR SEQ ID NO : 67 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 67 : 
RTTAAYTAAT TGTAACTCCA CAA 



(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 
GAMTYTATGC ACCTCCCATC 



(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69 
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GACATAACTA AATGGTTGTG G 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70 
ATACTTGARA GRTTAAGRAG AAT 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71 
ATGCCATGTG TACAAGTAAC 



(2) INFORMATION FOR SEQ ID NO:72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 
ATACACTATT GTGCTCCARC 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
.(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

AGTTCTCCAT ATATCTTTCA TR 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 



AACATAACTG GAATGATYCT AC 



(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
CTGAGRTCCG TGTACAAC 



(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
ATTAGGCAGG GATATCAACC 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77 
CCTACTCCAG GTGCRCAT 



(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 
CAWCACAAGC CTGYGTTCC 



(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
ATGTCTTCVT GCATTTGKTC 



(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
AATGGGACAC TCTCTARAGR 



(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 
TTAACTGTCA TGGAGAATTC TT 



(2) INFORMATION FOR SEQ ID NO : 82 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 82 
AAGAATTCTC CATGACAGTT AA 



(2) INFORMATION FOR SEQ ID NO : 83 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 83 
TAAGTGCAGC AGGVJAGCAC 



(2) INFORMATION FOR SEQ ID NO: 84: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
CCACAACCAT TTAGTTATGT C 



(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
TACCACCTCT TGAGCAACTT 



(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
CYTGTCTAAT YCTYCTTGG 



(2) INFORMATION FOR SEQ ID NO : 87 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87 
TGGCCTGGTA CAGCATGGG 



(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88 
GTACGAATTC CATGGAAGGG GAGTTGACCT GC 
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10 



25 



(2) INFORMATION FOR SEQ ID NO:39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:89: 
15 TATTGGATCC TTATCAGCTA TTTAGTTTTT GTAG 34 

(2) INFORMATION FOR SEQ ID NO: 90: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2214 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 0 : 



ATGAGTTTTG TGGTCATTAT TCCCGCGCGC TACGCGTCGA CGCGTCTGCC 50 

CGGTAAACCA TTGGTTGATA TTAACGGCAA ACCCATGATT GTTCATGTTC 10 0 

30 TTGAACGCGC GCGTGAATCA GGTGCCGAGC GCATCATCGT GGCAACCGAT 150 

CATGAGGATG TTGCCCGCGC CGTTGAAGCC GCTGGCGGTG AAGTATGTAT 200 

GACGCGCGCC GATCATCAGT CAGGAACAGA ACGTCTGGCG GAAGTTGTCG 25 0 

AAAAATGCGC ATTCAGCGAC GACACGGTGA TCGTTAATGT GCAGGGTGAT 3 00 

GAACCGATGA TCCCTGCGAC AATCATTCGT CAGGTTGCTG ATAACCTCGC 3 50 

35 TCAGCGTCAG GTGGGTATGG CG AC TCTGGC GGTGCCAATC CACAATGCGG 400 

AAGAAGCGTT TAACCCGAAT GCGGTGAAAG TGGTTCTCGA CGCTGAAGGG 450 

TATGCACTGT ACTTCTCTCG CGCCACCATT CCTTGGGATC GTGATCGTTT 50 0 

TGCAGAAGGC CTTGAAACCG TTGGCGATAA CTTCCTGCGT CATCTTGGTA 550 

TTTATGGCTA CCGTGCAGGC TTTATCCGTC GTTACGTCAA CTGGCAGCCA 60 0 

40 AGTCCGTTAG AACACATCGA AATGTTAGAG CAGCTTCGTG TTC TGTGGTA 650 

CGGCGAAAAA ATCCATGTTG CTGTTGCTCA GGAAGTTCCT GGCACAGGTG 70 0 

TGGATACCCC TGAAGATCCG TCGACAGCCC TTATGAAGAT CCCCGGCGAC 750 

C CGGGTGGTG GTGACATGCG TGACAACTGG CGTTCTGAAC TGTACAAATA 80 0 

CAAAGTTGTT AAAATCGAAC CGCTGGGTGT TGCTCCGACT AAAGCTAAAC 850 

45 GTCGTGTTGT TCAGCGTGAA AAACGCGCCG TTGGTATCGG TGCACTGTTC 900 

CTGGGTTTCC TGGGTGCTGC TGGTTCTACC ATGGGTGCTG CTTCTATGAC 950 

CCTGACTGTT CAGGCCCGTC AGCTTCTGTC TGGTATCGTT CAGCAGCAGA 100 0 

ACAATCTGCT GCGTGCTATC GAAGCTCAGC AGCATCTGCT GCAACTGACC 1050 

GTTTGGGGTA TCAAACAGCT TCAGGCTCGT ATCCTGGCTG TTGAACGTTA 1100 

50 CCTGAAAGAC CAGCAGCTGC TGGGTATCTG GGGTTGCTCT GGTAAAC TGA 1150 

TCTGCACTAC TGCTGTTCCG TGGAACGCTT CTTGGTCTAA CAAATCTCTG 1200 

GAACAGATCT GGAACAACAT GACTTGGATG GAATGGGACC GTGAAATCAA 1250 

CAACTACACA AGCTTGATCC ACTCTCTGAT CGAAGAAAGC CAGAACCAGC 13 00 

AGGAAAAAAA CGAACAGGAA CTTCTAGAAC TGGACAAATG GGTTAACCGT 13 50 

55 GTTCGTCAGG GTTACTCTCC GCTGTCTTTC CAGACCCATC TGCCGATCCC 1400 

GCGTGGTCCG GACCGTCCGG AAGGTATCGA AGAAGAAGGC GGCGAACGTG 1450 

ACCGTGACCG TTCCATTCGT CTGGTAATCG GTGGTGACAT GAAAGACATC 1500 

TGGCGTAACG AACTGTTCAA ATACAAAGTT GTTCGTGTTA AACCGTTCTC 155 0 

TGTTGCTCCG ACCCCGATCG CTCGTCCGGT TATCGGTACT GGCACCCACC 1600 

60 GTGAAAAACG TGCTGTAGGT CTGGGTATGC TGTTCCTGGG CGTTCTGTCT 1650 

GCAGCAGGTT CCACTATGGG TGCTGCAGCT ACCGCTCTGA CCGTACAGAC 17 00 

CCACTCTGTT ATCAAAGGTA TCGTACAGCA GCAGGACAAC CTGCTGCGTG 17 50 
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CAATCCAGGC ACAGCAGGAA CTGCTGCGTC TGTCTGTATG GGGTATCCGT 1800 

CAGCTGCGTG CTCGTCTGCT GGCACTGGAA ACCCTGATCC AGAACCAGCA 1850 

GCTGCTGAAC CTGTGGGGCT GCAAAGGTCG TCTGATCTGC TACACCTCCG 19 00 

TTAAATGGAA CGAAACCTGG CGTAACACCA CCAACATCAA CCAGATCTGG 19 50 

5 GGTAAC CTGA CCTGGCAGGA ATGGG AC C AG CAGATCGACA ACGTTTCTTC 20 0 0 

CACCATCTAC GAAGAAATCC AGAAAGC TC A GGTTCAGCAG GAACAGAACG 20 50 

AAAAAAAACT GCTGGAACTG GACGAATGGG CTTCTCTGTG GAACTGGCTG 2100 

GACATCACCA AATGGCTGCG TAACATCCGT CAGGGCTACC AGCCGCTGTC 2150 

CCTGCAGATC CCGACCCGTC AGCAGTCTGA AGCTGAAACT CCGGGTCGTA 22 0 0 

10 CCGGTGAATA ATAG 2214 



(2) INFORMATION FOR SEQ ID NO: 91: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



40 



60 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:91: 



25 MSFWIIPAR YASTRLPGKP LVDINGKPMI VHVLERARES GAERIIVATD 50 

HEDVARAVEA AGGEVCMTRA DHQSGTERLA EWEKCAFSD DTVIVNVQGD 100 

EPMIPATIIR QVADNLAQRQ VGMATLAVPI HNAEEAFNPN AVKWLDAEG 150 

YALYFSRATI PWDRDRFAEG LETVGDNFLR HLG I YGYRAG FIRRYVNWQP 2 00 

SPLEHIEMLE QLRVLWYGEK IHVAVAQEVP GTGVDTPEDP STALMKIPGD 250 

30 PGGGDMRDNW RSELYKYKW KIEPLGVAPT KAKRRWQRE KRAVG I GALF 3 00 

LGFLGAAGST MGAASMTLTV QARQLLSGIV QQQNNLLRAI EAQQHLLQLT 3 50 

VWGIKQLQAR ILAVERYLKD QQLLGIWGCS GKLICTTAVP WNASWSNKSL 40 0 

EQIWNNMTWM EWDRE INNYT SLIHSLIEES QNQQEKNEQE LLELDKWVNR 450 

VRQGYSPLSF QTHLPIPRGP DRPEGIEEEG GERDRDRSIR L V I GGDMKD I 500 

35 WRNELFKYKV VRVKPFSVAP TPIARPVIGT GTHREKRAVG LGMLFLGVLS 550 

AAGS TMGAAA TALTVQTHSV IKGIVQQQDN LLRAIQAQQE LLRLSVWGIR 60 0 

QLRARLLALE TLIQNQQLLN LWGCKGRLIC YTSVKWNETW RNTTNINQ IW 650 

GNLTWQEWDQ QIDNVSSTIY EEIQKAQVQQ EQNEKKLLEL DEWASLWNWL 7 00 

DITKWLRNIR QGYQPLSLQI PTRQQSEAET PGRTGE 73 6 



(2) INFORMATION FOR SEQ ID NO: 92: 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 2124 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: 



55 



SEQ ID NO: 92 : 




TACGCGTCGA 


CGCGTCTGCC 


50 


ACCCATGATT 


GTTCATGTTC 


100 


GCATCATCGT 


GGCAACCGAT 


150 


GCTGGCGGTG 


AAGTATGTAT 


200 


ACGTCTGGCG 


GAAGTTGTCG 


250 


TCGTTAATGT 


GCAGGGTGAT 


300 


CAGGTTGCTG 


ATAACCTCGC 


350 


GGTGCCAATC 


CACAATGCGG 


400 


TGGTTCTCGA 


CGCTGAAGGG 


450 


CCTTGGGATC 


GTGATCGTTT 


500 


CTTCCTGCGT 


CATCTTGGTA 


550 
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TTTATGGCTA CCGTGCAGGC TTTATCCGTC GTTACGTCAA CTGGCAGCCA 600 

AGTCCGTTAG AACACATCGA AATGTTAGAG CAGCTTCGTG TTC TGTGGT A 650 

CGGCGAAAAA ATCCATGTTG CTGTTGCTCA GGAAGTTCCT GGCACAGGTG 7 00 

TGGATACCCC TGAAGATCCG TCGACAGCCC TTATGAAGAT CCCCGGCGAC 750 

5 CCGGGTGGTG GTGACATGCG TGACAACTGG CGTTCTGAAC TGTACAAATA 800 

CAAAGTTGTT AAAATCGAAC CGCTGGGTGT TGCTCCGACT AAAGCTAAAC 850 

GTCGTGTTGT TCAGCGTGAA AAACGCGCCG TTGGTATCGG TGCACTGTTC 9 00 

CTGGGTTTCC TGGGTGC TGC TGGTTC T AC C ATGGGTGCTG CTTCTATGAC 95 0 

CCTGACTGTT CAGGCCCGTC AGCTTCTGTC TGGTATCGTT CAGCAGCAGA 1000 

10 ACAATCTGCT GCGTGCTATC GAAGCTCAGC AGCATCTGCT GCAACTGACC 1050 

GTTTGGGGTA TCAAACAGCT TCAGGCTCGT ATCCTGGCTG TTGAACGTTA 110 0 

CCTGAAAGAC CAGCAGCTGC TGGGTATCTG GGGTTGCTCT GGTAAACTGA 115 0 

TCTGCACTAC TGCTGTTCCG TGGAACGCTT CTTGGTCTAA CAAATCTCTG 12 0 0 

GAACAGATCT GGAACAACAT GACTTGGATG G AATGGG AC C GTGAAATCAA 1250 

15 CAACTACACA AGCTTGATCC ACTCTCTGAT CGAAGAAAGC CAGAACCAGC 13 0 0 

AGGAAAAAAA CGAACAGGAA CTTCTAGAAC TGGACAAATG GGTTAACCGT 13 50 

GTTCGTCAGG GTTACTCTCC GCTGTCTTTC CAGACCCATC TGCCGATCCC 140 0 

GCGTGGTCCG GACCGTCCGG AAGGTATCGA AGAAGAAGGC GGCGAACGTG 145 0 

ACCGTGACCG TTCCATTCGT CTGGTAATCG GTGGTGACAT GAAAGACATC 150 0 

20 TGGCGTAACG AACTGTTCAA ATACAAAGTT GTTCGTGTTA AACCGTTCTC 1550 

TGTTGCTCCG ACCCCGATCG CTCGTCCGGT TATCGGTACT GGCACCCACC 160 0 

GTGAAAAACG TGCTGTAGGT CTGGGTATGC TGTTCCTGGG CGTTCTGTCT 1650 

GCAGCAGGTT CCACTATGGG TGCTGCAGCT ACCGCTCTGA CCGTACAGAC 17 0 0 

CCACTCTGTT ATCAAAGGTA TCGTACAGCA GCAGGACAAC CTGCTGCGTG 17 5 0 

25 CAATCCAGGC ACAGCAGGAA CTGCTGCGTC TGTCTGTATG GGGTATCCGT 180 0 

CAGCTGCGTG CTCGTCTGCT GGCACTGGAA ACCCTGATCC AGAACCAGCA 1850 

GCTGCTGAAC CTGTGGGGCT GCAAAGGTCG TCTGATCTGC TACACCTCCG 19 00 

TTAAATGGAA CGAAACCTGG CGTAACACCA CCAACATCAA CCAGATCTGG 1950 

GGTAACCTGA CCTGGCAGGA ATGGGACCAG CAGATCGACA ACGTTTCTTC 2000 

30 CACCATCTAC GAAGAAATCC AGAAAGC TC A GGTTCAGCAG GAACAGAACG 2050 

AAAAAAAACT GCTGGAACTG GACGAATGGG CTTCTCTGTG GAACTGGCTG 210 0 

GACATCACCA AATGGC TGT A AT AG 2124 



35 (2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 06 amino acids 

(B) TYPE: amino acid 

40 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:93: 

MSFWIIPAR YASTRLPGKP LVDINGKPMI VHVLERARES GAERIIVATD 50 

HEDVARAVEA AGGEVCMTRA DHQSGTERLA EWEKCAFSD DTVIVNVQGD 10 0 

EPMIPATIIR QVADNLAQRQ VGMATLAVPI HNAEEAFNPN AVKWLDAEG 150 

50 YALYFSRATI PWDRDRFAEG LETVGDNFLR HLG I YGYRAG FIRRYVNWQP 200 

SPLEHIEMLE QLRVLWYGEK IHVAVAQEVP GTGVDTPEDP STALMKIPGD 250 

PGGGDMRDNW RSELYKYKW K I E PLGVAPT KAKRRWQRE KRAVGIGALF 3 00 

LGFLGAAGST MGAASMTLTV QARQLLSGIV QQQNNLLRAI EAQQHLLQLT 3 50 

VWGIKQLQAR ILAVERYLKD QQLLGIWGCS GKL I CTTAVP WNASWSNKSL 400 

55 EQIVJNNMTWM EWDRE INNYT SLIHSLIEES QNQQEKNEQE LLELDKWVNR 450 

VRQGYSPLSF QTHLPIPRGP DRPEGIEEEG GERDRDRSIR LVIGGDMKDI 500 

WRNELFKYKV VRVKPFSVAP TPIARPVIGT GTHREKRAVG LGMLFLGVLS 550 

AAGSTMGAAA TALTVQTHSV IKGIVQQQDN LLRAIQAQQE LLRLSVWGIR 60 0 

QLRARLLALE TLIQNQQLLN LWGCKGRLIC YTSVKWNETW RNTTNINQIW 650 

60 GNLTWQEWDQ QIDNVSSTIY EEIQKAQVQQ EQNEKKLLEL DEWASLWNWL 70 0 

DITKWL 706 
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(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH : 147 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

ATGATCGGTG GTGACATGAA AGACATCTGG CGTAACGAAC TGTTCAAATA 50 

CAAAGTTGTT CGTGTTAAAC CGTTCTCTGT TGCTCCGACC CCGATCGCTC 100 

GTC CGGTTAT CGGTACTGGC ACCCACCGTG AAAAACGTGC TGTAGGTCTG 150 

15 GGTATGCTGT TCCTGGGCGT TCTGTCTGCA GCAGGTTCCA CTATGGGTGC 2 00 

TGCAGCTACC GCTCTGACCG TACAGACCCA CTCTGTTATC AAAGGTATCG 250 

TACAGCAGCA GGACAACCTG CTGCGTGCAA TCCAGGCACA GCAGGAACTG 3 00 

CTGCGTCTGT CTGTATGGGG TATCCGTCAG CTGCGTGCTC GTCTGCTGGC 3 50 

ACTGGAAACC CTGATCCAGA ACCAGCAGCT GCTGAACCTG TGGGGC TGC A 40 0 

20 AAGGTCGTCT GATCTGCTAC ACCTCCGTTA AATGGAACGA AACCTGGCGT 450 

AACACCACCA ACATCAACCA GATCTGGGGT AACCTGACCT GGCAGGAATG 50 0 

GGACCAGCAG ATCGACAACG TTTCTTCCAC CATCTACGAA G AAATC C AG A 55 0 

AAGCTCAGGT TCAGCAGGAA CAGAACGAAA AAAAAC TGC T GGAACTGGAC 60 0 

GAATGGGCTT CTCTGTGGAA CTGGCTGGAC ATCACCAAAT GGC TGCGTAA 650 

25 CATCCGTCAG GGCTACCAGC CGCTGTCCCT GCAGATCCCG ACCCGTCAGC 700 

AGTCTGAAGC TGAAACTCCG GGTCGTACCG GTGAAGGTCC GGGTGGTGGT 7 50 

GACATGCGTG AC AAC TGGCG TTCTGAACTG TACAAATACA AAGTTGTTAA 30 0 

AATCGAACCG CTGGGTGTTG CTCCGACTAA AGC TAAACGT CGTGTTGTTC 850 

AGCGTGAAAA ACGCGCCGTT GGTATCGGTG CACTGTTCCT GGGTTTCCTG 900 

30 GGTGCTGCTG GTTCTACCAT GGGTGCTGCT TCTATGACCC TGACTGTTCA 950 

GGCCCGTCAG CTTCTGTCTG GTATCGTTCA GCAGCAGAAC AATCTGCTGC 1000 

GTGCTATCGA AGC TC AGC AG CATCTGCTGC AACTG AC CGT TTGGGGTATC 1050 

AAACAGCTTC AGGCTCGTAT CCTGGCTGTT G AACGTT AC C TGAAAGACCA 110 0 

GCAGCTGCTG GGTATCTGGG GTTGCTCTGG TAAACTGATC TGCACTACTG 1150 

35 CTGTTCCGTG GAACGCTTCT TGGTCTAACA AATCTCTGGA AC AGATC TGG 120 0 

AACAACATGA CTTGGATGGA ATGGGACCGT GAAATCAACA ACTACACAAG 125 0 

CTTGATCCAC TCTCTGATCG AAGAAAGC C A G AAC C AGC AG GAAAAAAACG 13 0 0 

AACAGGAACT TC TAG AAC TG GACAAATGGG TTAACCGTGT TCGTCAGGGT 13 50 

TACTCTCCGC TGTCTTTCCA GACCCATCTG CCGATCCCGC GTGGTC CGGA 14 0 0 

40 CCGTCCGGAA GGTATCGAAG AAGAAGGCGG CGAACGTGAC CGTGACCGTT 1450 

CCATTCGTCT GGTATAATAG 147 0 



45 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 488 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 



MIGGDMKDIW 
GMLFLGVLSA 
LRLSVWGIRQ 
NTTNINQIWG 
60 EWASLWNWLD 
DMRDNWRSEL 
GAAGSTMGAA 



RNELFKYKW 
AGSTMGAAAT 
LRARLLALET 
NLTWQEWDQQ 
ITKWLRNIRQ 
YKYKWK I E P 
SMTLTVQARQ 



RVKPFSVAPT 
ALTVQTHSVI 
LIQNQQLLNL 
IDNVSSTIYE 
GYQPLSLQIP 
LGVAPTKAKR 
LLSGIVQQQN 



PIARPVIGTG 
KGIVQQQDNL 
WGCKGRLICY 
EIQKAQVQQE 
TRQQSEAETP 
RWQREKRAV 
NLLRAIEAQQ 



THREKRAVGL 
LRAIQAQQEL 
TSVKWNETWR 
QNEKKLLELD 
GRTGEGPGGG 
GIGALFLGFL 
HLLQLTVWGI 



50 
100 
150 
200 
250 
300 
350 
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KQLQARILAV ERYLKDQQLL GIWGCSGKLI CTTAVPWNAS WSNKSLEQIW 40 0 

NNMTWMEWDR EINNYTSLIH SLIEESQNQQ EKNEQELLEL DKWVNRVRQG 450 
YSPLSFQTHL PIPRGPDRPE GIEEEGGERD RDRSIRLV 488 

5 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1584 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 



ATGAGTTTTG TGGTCATTAT TCCCGCGCGC TACGCGTCGA CGCGTCTGCC 50 

CGGTAAACCA TTGGTTGATA TTAACGGCAA ACCCATGATT GTTCATGTTC 10 0 

TTGAACGCGC GCGTGAATCA GGTGC CGAGC GCATCATCGT GGCAACCGAT 150 

CATGAGGATG TTGCCCGCGC CGTTGAAGCC GCTGGCGGTG AAGTATGTAT 200 

20 GACGCGCGCC GATCATCAGT CAGGAACAGA ACGTCTGGCG GAAGTTGTCG 250 

AAAAATGCGC ATTCAGCGAC GACACGGTGA TCGTTAATGT GCAGGGTGAT 3 00 

GAACCGATGA TCCCTGCGAC AATCATTCGT CAGGTTGCTG ATAAC CTCGC 3 50 

TCAGCGTCAG GTGGGTATGA CGACTCTGGC GGTGC CAATC CACAATGCGG 40 0 

AAGAAGCGTT TAAC CCGAAT GCGGTGAAAG TGGTTCTCGA CGCTGAAGGG 450 

25 TATGCACTGT ACTTCTCTCG CGCCACCATT CCTTGGGATC GTGATCGTTT 50 0 

TGCAGAAGGC CTTGAAACCG TTGGCGATAA CTTCCTGCGT CATCTTGGTA 55 0 

TTTATGGCTA CCGTGCAGGC TTTATCCGTC GTTACGTCAA CTGGCAGCCA 600 

AGTCCGTTAG AACACATCGA AATGTTAGAG CAGCTTCGTG TTCTGTGGTA 650 

CGGCGAAAAA ATCCATGTTG CTGTTGCTCA GGAAGTTCCT GGCACAGGTG 7 00 

30 TGGATACCCC TGAAGATCTC GACCCGTCGA CGAATTCTAT CGGTGGTGAC 750 

ATGAAAGACA TCTGGCGTAA CGAACTGTTC AAATACAAAG TTGTTCGTGT 80 0 

TAAAC CGTTC TCTGTTGCTC CGACCCCGAT CGCTCGTCCG GTTATCGGTA 850 

CTGGCACCCA CCGTGAAAAA CGTGCTGTAG GTC TGGGT AT GCTGTTCCTG 900 

GGCGTTCTGT CTGCAGCAGG TTCCACTATG GGTGCTGCAG CTACCGCTCT 950 

35 G AC CGTAC AG ACCCACTCTG TTATCAAAGG TATCGTACAG CAGCAGGACA 10 0 0 

ACCTGCTGCG TGCAATCCAG GC AC AGCAGG AACTGCTGCG TCTGTCTGTA 10 50 

TGGGGTATCC GTCAGCTGCG TGCTCGTCTG CTGGCACTGG AAACC CTGAT 110 0 

CCAGAACCAG CAGCTGCTGA AC CTGTGGGG CTGCAAAGGT CGTCTGATCT 1150 

GCTACACCTC CGTTAAATGG AACGAAACCT GGCGTAACAC CACCAACATC 120 0 

40 AACCAGATCT GGGGTAACCT GAC CTGGC AG GAATGGGACC AGCAGATCGA 1250 

CAACGTTTCT TCCACCATCT ACGAAGAAAT CCAGAAAGCT CAGGTTCAGC 13 0 0 

AGGAACAGAA CGAAAAAAAA CTGCTGGAAC TGGACGAATG GGCTTCTCTG 1350 

TGGAACTGGC TGGACATCAC CAAATGGCTG CGTAACATCC GTCAGGGCTA 140 0 

CCAGCCGCTG TCCCTGCAGA TCCCGACCCG TCAGCAGTCT GAAGCTGAAA 1450 

45 CTC CGGGTCG TACCGGTGAA GGTGGCGGTT CTCGCCTGCT GGCTCTGGAA 1500 

ACTCTGATTC AGAACCAGCA ACTGCTTAAC CTGTGGGGTT GCAAGGGCCG 1550 

CCTGATTTGC TACACTTCTG TAAAATGGTA ATAG 1584 

50 (2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 6 amino acids 

(B) TYPE: amino acid 

55 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:97: 

MSFWIIPAR YASTRLPGKP LVDINGKPMI VHVLERARES GAERI IVATD 50 
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HEDVARAVEA AGGEVCMTRA DHQSGTERLA EWEKCAFSD DTVIVNVQGD 

EPMIPATIIR QVADNLAQRQ VGMTTLAVP I HNAEEAFNPN AVKWLDAEG 

YALYFSRATI PVJDRDRFAEG LETVGDNFLR HLGIYGYRAG FIRRYVNWQP 

SPLEHIEMLE QLRVLWYGEK IHVAVAQEVP GTGVDTPEDL DPSTNSIGGD 

MKDIWRNELF KYKWRVKPF SVAPTPIARP VIGTGTHREK RAVGLGMLFL 

GVLSAAGSTM GAAATALTVQ THSVIKGIVQ QQDNLLRAIQ AQQELLRLSV 

WGIRQLRARL LALETLIQNQ QLLNLWGCKG RLICYTSVKW NETWRNTTNI 

NQIWGNLTWQ EWDQQIDNVS STIYEEIQKA QVQQEQNEKK LLELDEWASL 

WNWLDITKWL RNIRQGYQPL SLQIPTRQQS EAETPGRTGE GGGSRLLALE 
TLIQNQQLLN LWGCKGRL I C YTSVKW 



(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
GACCGTCCGG AAGGTATCGA AGAAGAAGGC GGCGAACGTG ACCGTGACCG TTCCATTCGT 



(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:99: 
ATGGAACGGT CACGGTCACG TTCGCCGCCT TCTTCTTCGA TACCTTCCGG ACG 



(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 0: 
ATCTCTGGAA CAGATCTGGA 



(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
AGTACTGAAG CAGATTCCAC 



100 
150 
200 
250 
300 
350 
400 
450 
500 
526 
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(2) INFORMATION FOR SEQ ID NO: 102: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
CCGTCGTTAC GTCAACTGG 



(2) INFORMATION FOR SEQ ID NO:103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
CGCCGTTGGT ATCGGTGC 



(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: XXX base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104 
TACCAGACAG AAGCTGACG 



(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105 
CTTCGATCAG AGAGTGGATC 



(2) INFORMATION FOR SEQ ID NO: 10 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 6: 
GACGATCTGC GTTCTCTGTG 

5 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1800 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



20 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107 



50 



ATGAGTTTTG TGGTCATTAT TCCCGCGCGC TACGCGTCGA CGCGTCTGCC 

CGGTAAACCA TTGGTTGATA TTAACGGCAA ACCCATGATT GTTCATGTTC 10 0 

TTGAACGCGC GCGTGAATCA GGTGCCGAGC GCATCATCGT GGCAACCGAT 150 

CATGAGGATG TTGCCCGCGC CGTTGAAGCC GCTGGCGGTG AAGTATGTAT 200 

20 GACGCGCGCC GATCATCAGT CAGGAACAGA ACGTCTGGCG GAAGTTGTCG 250 

AAAAATGCGC ATTCAGCGAC GACACGGTGA TCGTTAATGT GCAGGGTGAT 300 

GAACCGATGA TCCCTGCGAC AATCATTCGT CAGGTTGCTG ATAACCTCGC 3 b U 

TCAGCGTCAG GTGGGTATGG CGACTCTGGC GGTGCCAATC CACAATGCGG 40 0 
AAGAAGCGTT TAACCCGAAT GCGGTGAAAG TGGTTCTCGA CGCTG AAGGG 
25 TATGCACTGT ACTTCTCTCG CGCCACCATT CCTTGGGATC GTGATCGTTT 
TGCAGAAGGC CTTGAAACCG TTGGCGATAA CTTCCTGCGT CATCTTGGTA 

TTTATGGCTA CCGTGCAGGC TTTATCCGTC GTTACGTCAA CTGGCAGCCA &0 0 
AGTCCGTTAG AACACATCGA AATGTTAGAG CAGCTTCGTG TTCTGTGGTA 
CGGCGAAAAA ATCCATGTTG CTGTTGCTCA GGAAGTTCCT GGCACAGGTG 
30 TGGATACCCC TGAAGATCCG TCGACAGCCC TTATGAAGAT CCCCGGCGAC 
CCGGGTGGTG GTGACATGCG TGACAACTGG CGTTCTGAAC TGTACAAATA 
CAAAGTTGTT AAAATCGAAC CGCTGGGTGT TGCTCCGACT AAAGCTAAAC 

GTCGTGTTGT TCAGCGTGAA AAACGCGCCG TTGGTATCGG TGCACTGTTC 900 

CTGGGTTTCC TGGGTGCTGC TGGTTCTACC ATGGGTGCTG CTTCTATGAC 950 

35 CCTGACTGTT CAGGCCCGTC AGCTTCTGTC TGGTATCGTT CAGCAGCAGA 1000 

ACAATCTGCT GCGTGCTATC GAAGCTCAGC AGCATCTGCT GCAACTGACC 105 0 

GTTTGGGGTA TCAAACAGCT TCAGGCTCGT ATCCTGGCTG TTGAACGTTA 110 0 

CCTGAAAGAC CAGCAGCTGC TGGGTATCTG GGGTTGCTCT GGTAAACTGA 1150 

TCTGCACTAC TGCTGTTCCG TGGAACGCTT CTTGGTCTAA CAAATCTCTG 12 0 0 

40 GAACAGATCT GGAACAACAT GACTTGGATG GAATGGGACC GTGAAATCAA 1250 

CAACTACACA AGCTTGATCC ACTCTCTGAT CGAAGAAAGC CAGAACCAGC 1J0 0 

AGGAAAAAAA CGAACAGGAA CTTCTAGAAC TGGACAAATG GGTTAACCGT 13 50 

GTTCGTCAGG GTTACTCTCC GCTGTCTTTC CAGACCCATC TGCCGATCCC 140 0 

GCGTGGTCCG GACCGTCCGG AAGGTATCGA AGAAGAAGGC GGCGAACGTG 1450 

45 ACCGTGACCG TTCCATTCGT CTGGTAAACG GTTCTCTGGC TCTGATCTGG 1500 

GACGATCTGC GTTCTCTGTG CCTGTTCTCT TACCACCGTC TGCGTGATCT 1550 

GCTGCTGATC GTGACTCGTA TCGTTGAACT GCTCGGCCGT CGTGGTTGGG 160 0 

AAGCTCTGAA AT AC TGGTGG AATCTGCTTC AGT AC TGGTC CCAGGAACTG 1650 

AAAAACTCTG CTGTTTCTCT GCTGAACGCT ACTGCTATCG CTGTTGCTGA IV 0 0 

50 AGGCACCGAT CGTGTTATCG AAGTAGTTCA GGGTGCTTAC CGTGCTATCC 1750 

GTCACATTCC GCGTCGTATC CGTCAGGGTC TGGAACGTAT CCTGCTGTAA 18 0 0 



450 
500 
550 



650 
700 
750 
800 
850 



(2) INFORMATION FOR SEQ ID NO: 108: 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 599 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
60 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

MSFWIIPAR YASTRLPGKP LVDINGKPMI VHVLERARES GAERIIVATD 50 

5 HEDVARAVEA AGGEVCMTRA DHQSGTERLA EWEKCAFSD DTVIVNVQGD 10 0 

EPMIPATIIR QVADNLAQRQ VGMATLAVPI HNAEEAFNPN AVKWLDAEG 150 

YALYFSRATI PWDRDRFAEG LETVGDNFLR HLGIYGYRAG FIRRYVNWQP 200 

SPLEHIEMLE QLRVLWYGEK IHVAVAQEVP GTGVDTPEDP STALMKIPGD 250 

PGGGDMRDNW RSELYKYKW KIEPLGVAPT KAKRRWQRE KRAVGIGALF 3 00 

10 LGFLGAAGST MGAASMTLTV QARQLLSGIV QQQNNLLRAI EAQQHLLQLT 3 50 

VWGIKQLQAR ILAVERYLKD QQLLGIWGCS GKLICTTAVP WNASWSNKSL 400 

EQIWNNMTWM EWDREINNYT SLIHSLIEES QNQQEKNEQE LLELDKWVNR 450 

VRQGYSPLSF QTHLPIPRGP DRPEGIEEEG GERDRDRSIR LVNGSLALIW 50 0 

DDLRSLCLFS YHRLRDLLLI VTRIVELLGR RGWEALKYWW NLLQYWSQEL 550 

15 KNSAVSLLNA T A I AVAEGTD RVIEWQGAY RAIRHIPRRI RQGLERILL 599 



(2) INFORMATION FOR SEQ ID NO: 109: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:109: 
GACTACTTGT AGCCATTCGT CTGGTAATCG GTGGTGACAT GAAAGAC 



(2) INFORMATION FOR SEQ ID NO:110: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

40 

ACAATGATGG TACCTATTAT TCACCGGTAC GAC 



45 



55 



(2) INFORMATION FOR SEQ ID NO: 111: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
50 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

18 



ATTGGTTGAT ATTAACGG 



(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 
TCGGTGGTGA CATGAAAGAC 



(2} INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 
AAAATAGGCG TATCACGAGG 



(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 
ACAATGATGG TAC CTATTAC AGCCATTTGG TGATGTC C AG 



(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
TAACGATCAG CTACCGGTGA AGGTCCGGGT GGTGGTGACA TGCGTG 



(2) INFORMATION FOR SEQ ID NO: 11 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 
CAAGATGGAT CCTATTATAC CAGACGAATG GAACGGTC 



(2) INFORMATION FOR SEQ ID NO: 117: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 122 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

CCGGTGAAGG TGGCGGTTCT CGCCTGCTGG CTCTGGAAAC TCTGATTCAG AACCAGCAAC 
TGCTTAACCT GTGGGGTTGC AAGGGCCGCC TGATTTGCTA CACTTCTGTA AAATGGTAAT 
10 AG 



(2) INFORMATION FOR SEQ ID NO: 118: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 



GATCCTATTA CCATTTTACA GAAGTGTAGC AAATCAGGCG GCCCTTGCAA CCCCACAGGT 
TAAGCAGTTG CTGGTTCTGA ATCAGAGTTT CCAGAGCCAG CAGGCGAGAA CCGCCACCTT 
25 CA 



60 
120 
122 



(2) INFORMATION FOR SEQ ID NO: 119: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 849 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 



ATGATCGGTG GTGACATGAA AGACATCTGG CGTAACGAAC TGTTCAAATA 50 

CAAAGTTGTT CGTGTTAAAC CGTTCTCTGT TGCTCCGACC CCGATCGCTC 10 0 

40 GTCCGGTTAT CGGTACTGGC ACCCACCGTG AAAAACGTGC TGTAGGTCTG 150 

GGTATGCTGT TCCTGGGCGT TCTGTCTGCA GCAGGTTCCA CTATGGGTGC 20 0 

TGCAGCTACC GCTCTGACCG TACAGACCCA CTCTGTTATC AAAGGTATCG 2 50 

TACAGCAGCA GGACAACCTG CTGCGTGCAA TCCAGGCACA GCAGGAACTG 30 0 

CTGCGTCTGT CTGTATGGGG TATCCGTCAG CTGCGTGCTC GTCTGCTGGC 35 0 

45 ACTGGAAACC CTGATCCAGA ACCAGCAGCT GCTGAACCTG TGGGGCTGCA 40 0 

AAGGTCGTCT GATCTGCTAC ACCTCCGTTA AATGGAACGA AAC CTGGCGT 450 

AACACCACCA ACATCAACCA GATCTGGGGT AACCTGACCT GGCAGGAATG 50 0 

GGACCAGCAG ATCGACAACG TTTCTTCCAC CATCTACGAA GAAATCCAGA ^bU 

AAGCTCAGGT TCAGCAGGAA CAGAACGAAA AAAAACTGCT GG AAC TGG AC 60 0 

50 GAATGGGCTT CTCTGTGGAA CTGGCTGGAC ATCACCAAAT GGCTGCGTAA 650 

CATCCGTCAG GGCTACCAGC CGCTGTCCCT GCAGATCCCG ACCCGTCAGC 700 

AGTCTGAAGC TGAAACTCCG GGTCGTACCG GTGAAGGTGG CGGTTCTCGC 75 0 

CTGCTGGCTC TGGAAACTCT GATTCAGAAC CAGCAACTGC TTAACCTGTG 800 

GGGTTGCAAG GGCCGCCTGA TTTGCTACAC TTCTGTAAAA TGGTAATAG 849 

55 

(2) INFORMATION FOR SEQ ID NO: 12 0: 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 281 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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10 



(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

MIGGDMKDIW RNELFKYKW RVKPFSVAPT PIARPVIGTG THREKRAVGL 50 

GMLFLGVLSA AGSTMGAAAT ALTVQTHSVI KGIVQQQDNL LRAIQAQQEL 10 0 

LRLSVWGIRQ LRARLLALET LIQNQQLLNL WGCKGRLICY TSVKWNETWR 150 

NTTNINQIVJG NLTWQEWDQQ IDNVSSTIYE EIQKAQVQQE QNEKKLLELD 200 

EWASLWNWLD ITKWLRNIRQ GYQPLSLQIP TRQQSEAETP GRTGEGGGSR 250 

LLALETLIQN QQLLNLWGCK GRLICYTSVK W 281 



15 (2) INFORMATION FOR SEQ ID NO:121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

22 

25 ACGTTCGCCG CCTTCTTCTT CG 
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What is claimed is: 

1 An isolated HIV- 1 Group O env polypeptide having an amino acid sequence 
consisting essentially of the sequence of Figure 1 (SEQ ID NO:61). 

2. An isolated HIV-1 Group O env polypeptide comprising an immunoreactive 
portion of a polypeptide according to Claim 1. 

3 . A polynucleotide encoding a polypeptide according to any of Claims 1 and 2. 

4. An antigen construct comprising a first HIV-1 Group O env polypeptide fused 
to a second HIV-1 Group O env polypeptide. 

5 . An antigen construct according to Claim 4 wherein the first polypeptide is a 
gpl20 polypeptide, and the second polypeptide is a gp41 polypeptide. 

6 . An antigen construct according to Claim 5 wherein a portion of the hydrophobic 
region of the gp41 polypeptide has been deleted. 

7 . An antigen construct according to any of Claims 4, 5 and 6 wherein at least one 
of the first and second HIV-1 Group O env polypeptides is derived from HIV-1 Group O 
isolate HAM1 12. 

8 . An antigen construct according to Claim 4 wherein the first polypeptide 
comprises an immunoreactive portion of the gpl20 protein of HIV-1 Group O isolate 
HAM112. 

9 . An antigen construct according to Claim 8 wherein the first polypeptide has an 
amino acid sequence consisting essentially of residues 1 through 520 of the sequence of 
Figure 1 (SEQ ID NO:61) or a portion thereof. 

10. An antigen construct according to Claim 9 wherein the first polypeptide has an 
amino acid sequence consisting essentially of residues 476 through 520 of the sequence of 
Figure 1 (SEQ ID NO:61). 
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11. An antigen construct according to any of Claims 4, 8, 9 and 10 wherein the 
second polypeptide comprises an immunoreactive portion of the gp41 protein of HIV-1 Group 
O isolate HAM1 12. 

12. An antigen construct according to Claim 1 1 wherein the second polypeptide has 
an amino acid sequence consisting essentially of residues 521 through 873 of the sequence of 
Figure 1 (SEQ ID NO:61) or a portion thereof. 

13. An antigen construct according to Claim 1 1 wherein a portion of the 
hydrophobic region of the gp41 protein of HIV- 1 Group O isolate HAM1 12 is absent from the 
second polypeptide. 

14. An antigen construct according to Claim 1 1 wherein the deleted portion is that 
part of gp41 having an amino acid sequence consisting essentially of residues 690 through 715 
of the sequence of Figure 1 (SEQ ID NO:61). 

15. An antigen construct according to Claim 1 1 wherein the second polypeptide has 
an amino acid sequence consisting essentially of residues 47 through 373 of Figure 9 (SEQ ED 
NO:52). 

16. An antigen construct according to Claim 1 1 wherein the second polypeptide has 
an amino acid sequence consisting essentially of residues 47 through 245 of Figure 7 (SEQ ID 
NO:48). 

17. An antigen construct according to Claim 1 1 wherein the second polypeptide has 
an amino acid sequence consisting essentially of residues 47 through 215 of Figure 5 (SEQ ID 
NO:58). 

18. An antigen construct according to Claim 4 selected from the group consisting of 
P GO-8PL, pGO-8CKS, pGO-9PL, pGO-9CKS, pGO-HPL and pGO-UCKS, and 
derivatives, variants and analogs thereof. 

19. An antigen construct comprising a fusion of at least one HIV-1 Group O env 
polypeptide with at least one HIV-1 Group M env polypeptide. 
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20 . An antigen construct according to Claim 19 comprising a fusion of: 

(a) a first HTV- 1 Group O env polypeptide; 

(b) a second HIV- 1 Group O env polypeptide; 

(c) a first HTV-1 Group M env polypeptide; and 

(d) a second HIV- 1 Group M env polypeptide. 

21. An antigen construct according to Claim 20 wherein the HTV- 1 Group M 
polypeptides are derived from an HJV-1 isolate of Subtype B. 

22. An antigen construct according to Claim 21 wherein at least one of the HTV-1 
Group M polypeptides is derived from HIV-1 Group M isolate HXB2R. 

23 . An antigen construct according to any of Claims 20, 21 and 22 wherein at least 
one of the HIV-1 Group O sequences is derived from HIV-1 Group O isolate HAM112. 

24 . An antigen construct according to Claim 20 wherein the first Group O env 
polypeptide and the first Group M env polypeptide are both gpl20 polypeptides, and the 
second Group O env polypeptide and the second Group M env polypeptide are both gp41 
polypeptides. 

25 . An antigen construct according to Claim 24 wherein a portion of the 
hydrophobic region of at least one of the gp41 polypeptides has been deleted. 

26. An antigen construct according to Claim 20 wherein: 

(a) the first HIV- 1 Group O env polypeptide comprises an immunoreactive 
portion of the gpl20 protein of HTV-1 Group O isolate HAM1 12; 

(b) the second HIV- 1 Group O env polypeptide comprises an 
immunoreactive portion of the gp41 protein of HTV-1 Group O isolate HAM1 12 

(c) the first HTV- 1 Group M env polypeptide comprises an immunoreactive 
portion of the gpl20 protein of a first HTV-1 Group M isolate of Subtype B; and 

(d) the second HIV- 1 Group M env polypeptide comprises an 
immunoreactive portion of the gp41 protein of a second HTV-1 Group M isolate of Subtype B. 

27 . An antigen construct according to Claim 26 wherein the first and second HTV-1 
Group M isolates of Subtype B are the same and are HIV-1 Group M isolate HXB2R. 
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28 . An antigen construct according to Claim 27 wherein a portion of the 
hydrophobic region of the gp41 protein of HTV-1 Group M isolate HXB2R is absent from the 
second HIV-1 Group M env polypeptide. 

29. An antigen construct according to Claim 26 wherein: 

(a) the first fflV- 1 Group M env polypeptide has an amino acid sequence 
consisting essentially of residues 251 through 292 of the sequence of Figure 12 (SEQ ID 
NO: 108); and 

(b) the second HIV-1 Group M env polypeptide has an amino acid sequence 
consisting essentially of residues 293 through 599 of the sequence of Figure 12 (SEQ ID 

NO: 108) or a portion thereof. 

30 . An antigen construct according to Claim 29 wherein the second HIV- 1 Group 
M env polypeptide has an amino acid sequence consisting essentially of residues 293 through 
492 of the sequence of Figure 12 (SEQ ID NO: 108). 

31. An antigen construct according to any of Claims 20, 24, 25, 26, 27, 28, 29 and 
30 wherein the first HIV-1 Group O env polypeptide has an amino acid sequence consisting 
essentially of residues 1 through 520 of the sequence of Figure 1 (SEQ ID NO:61) or a portion 
thereof. 

32. An antigen construct according to Claim 3 1 wherein the second HIV-1 Group O 
env polypeptide has an amino acid sequence consisting essentially of residues 521 through 873 
of the sequence of Figure 1 (SEQ ID NO:61) or a portion thereof. 

33. An antigen construct according to Claim 3 1 wherein the first HIV- 1 Group O 
env polypeptide has an amino acid sequence consisting essentially of residues 476 through 520 
of the sequence of Figure 1 (SEQ ID NO:61). 

34. An antigen construct according to Claim 33 wherein a portion of the 
hydrophobic region of the gp41 protein of fflV-1 Group O isolate HAM112 is absent from the 
second HIV-1 Group O env polypeptide. 

35. An antigen construct according to Claim 33 wherein the second HIV- 1 Group O 
env polypeptide has an amino acid sequence consisting essentially of residues 47 through 373 
of Figure 9 (SEQ ID NO:52). 
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36. An antigen construct according to Claim 33 wherein the second HIV-1 Group O 
env polypeptide has an amino acid sequence consisting essentially of residues 47 through 245 
of Figure 7 (SEQ ID NO:48). 

37 . An antigen construct according to Claim 33 wherein the second HIV- 1 Group O 
env polypeptide has an amino acid sequence consisting essentially of residues 47 through 215 
of Figure 5 (SEQ ID NO:58). 

38 . An antigen construct according to Claim 19 selected from the group consisting 
of pGO-12CKS, pGO-13CKS and pGO-14PL, and derivatives, variants and analogs thereof. 

39. An antigen construct comprising a fusion of a first HIV-1 env polypeptide, a 
second HTV-1 env polypeptide, and at least one additional HIV-1 polypeptide. 

40. An antigen construct according to Claim 39 wherein the first HIV-1 env 
polypeptide and the second HTV-1 env polypeptide are each HIV-1 Group O polypeptides. 

41. An antigen construct according to Claim 40 wherein the first HTV- 1 Group O 
env polypeptide is a gpl20 polypeptide, and the second HIV-1 Group O env polypeptide is a 
gp41 polypeptide. 

42. An antigen construct according to Claim 41 wherein the first HIV-1 Group O 
env polypeptide comprises an immunoreactive portion of the gpl20 protein of HIV-1 Group O 
isolate HAM112, and the second HTV-1 Group O env polypeptide comprises an 
immunoreactive portion of the gp41 protein of HTV-1 Group O isolate HAM1 12. 

43 . An antigen construct according to Claim 42 wherein the first HIV- 1 Group O 
env polypeptide has an amino acid sequence consisting essentially of residues 1 through 520 of 
the sequence of Figure 1 (SEQ TD NO:61) or a portion thereof. 

44. An antigen construct according to Claim 43 wherein the second HIV- 1 Group O 
env polypeptide has an amino acid sequence consisting essentially of residues 521 through 873 
of the sequence of Figure 1 (SEQ ID NO:6 1) or a portion thereof. 

45 . An antigen construct according to Claim 43 wherein the first HIV- 1 Group O 
env polypeptide has an amino acid sequence consisting essentially of residues 476 through 520 
of the sequence of Figure 1 (SEQ ID NO:61). 
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46 . An antigen construct according to Claim 45 wherein a portion of the 
hydrophobic region of the gp41 protein of fflV-1 Group O isolate HAM1 12 is absent from the 
second fflV-1 Group O env polypeptide. 

47 . An antigen construct according to Claim 45 wherein the second HIV- 1 Group O 
env polypeptide has an amino acid sequence consisting essentially of residues 47 through 373 
of Figure 9 (SEQ ID NO:52). 

48 . An antigen construct according to Claim 45 wherein the second HIV- 1 Group O 
env polypeptide has an amino acid sequence consisting essentially of residues 47 through 245 
of Figure 7 (SEQ ID NO:48). 

49. An antigen construct according to Claim 45 wherein the second HIV-1 Group O 
env polypeptide has an amino acid sequence consisting essentially of residues 47 through 215 
of Figure 5 (SEQ ID NO:58). 

50. An antigen construct according to any of Claims 40, 41, 42, 43, 44, 45, 46, 
47, 48 and 49 wherein the additional HTV-1 polypeptide is a Group O env polypeptide. 

51. An antigen construct according to Claim 50 wherein the additional FflV-1 
Group O polypeptide comprises an immunoreaclive portion of the gp41 protein of HIV-1 
Group O isolate HAM112. 

52. An antigen construct according to Claim 5 1 wherein the additional FflV-1 
Group O polypeptide has an amino acid sequence consisting essentially of residues 521 
through 873 of the sequence of Figure 1 (SEQ ID NO:61) or a portion thereof. 

53. An antigen construct according to Claim 52 wherein a portion of the 
hydrophobic region of the gp41 protein of HIV-1 Group O isolate HAM112 is absent from the 
additional HIV-1 Group O polypeptide. 

54. An antigen construct according to Claim 50 wherein the additional HTV- 1 
Group O env polypeptide has an amino acid sequence consisting essentially of residues 47 
through 373 of Figure 9 (SEQ ID NO:52). 

55. An antigen construct according to Claim 50 wherein the additional HIV- 1 
Group O env polypeptide has an amino acid sequence consisting essentially of residues 47 
through 245 of Figure 7 (SEQ ID NO:48). 
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56. An antigen construct according to Claim 50 wherein the additional HIV-1 
Group O env polypeptide has an amino acid sequence consisting essentially of residues 47 
through 215 of Figure 5 (SEQ ID NO:58). 

57 . An antigen construct according to Claim 50 wherein the additional HIV- 1 
Group O env polypeptide has an amino acid sequence consisting essentially of residues 250 
through 281 of Figure 17 (SEQ ID NO: 120). 

58. An antigen construct according to Claim 40 selected from the group consisting 
of pGO-15CKS and pGO-15PL, and derivatives, variants and analogs thereof. 

59. An antigen construct comprising a first HTV-2 env polypeptide fused to a 
second HIV-2 env polypeptide. 

60. An antigen construct according to Claim 59 wherein the first HIV-2 env 
polypeptide is a gpl20 polypeptide, and the second HIV-2 env polypeptide is a gp36 
polypeptide. 

61. An antigen construct according to Claim 60 wherein: 

(a) the first HTV-2 env polypeptide has an amino acid sequence consisting 
essentially of residues 248 through 307 of the sequence of Figure 11 (SEQ ID NO:55) or a 

portion thereof; and 

(b) the second HIV-2 env polypeptide has an amino acid sequence 
consisting essentially of residues 308 through 466 of the sequence of Figure 11 (SEQ ID 
NO:55) or a portion thereof. 

62. An antigen construct according to Claim 59 which is pHTV-210, and 
derivatives, variants and analogs thereof. 

63 . A polynucleotide encoding an antigen construct according to any of Claims 4, 
18, 19, 38, 39, 58, 59 and 62. 

64. A polynucleotide according to Claim 63 operably linked to a control sequence 
capable of directing expression in a suitable host. 

65. A polynucleotide according to Claim 63 wherein the coding sequence is 
modified to provide a codon bias appropriate to the expression host. 
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66 . An expression vector comprising a polynucleotide according to Claim 63. 

67. A host cell transformed by an expression vector according to Claim 66. 

68 . A host cell according to Claim 67 wherein the host is Escherichia coll 

69. A method for detecting antibodies to HIV-1 in a test sample comprising the 
steps of: 

(a) combining at least one antigen construct according to any of Claims 4, 

18, 19, 38, 39, 58, 59 and 62 with the test sample to form a mixture; 

(b) incubating the mixture under conditions suitable for formation of 
complexes between the antigen and antibodies, if any, which are present in the sample and are 
immunologically reactive with the antigen; and 

(c) detecting the presence of any complexes formed. 

70. A method according to Claim 69 wherein detecting the presence of complexes in 
step (c) is carried out using an additional antigen construct according to any of Claims 4, 18, 

19, 38, 39, 58, 59 and 62 to which is attached a signal- generating compound. 

71. A method according to Claim 69 wherein detecting the presence of complexes in 
step (c) is carried out using an additional antigen construct according to any of Claims 4, 18, 
19, 38, 39, 58, 59 and 62 to which is attached a first member of a specific binding pair, and 
further using an indicator reagent comprising a second member of the specific binding pair to 
which is attached a signal-generating compound. 

72. A method according to Claim 69 wherein detecting the presence of complexes in 
step (c) is carried out using an antibody directed to the complexes formed in step (b) to which 
is attached a signal-generating compound. 

73. A method according to Claim 69 wherein detecting the presence of complexes in 
step (c) is carried out using an antibody directed to the complexes formed in step (b) to which 
is attached a first member of a specific binding pair, and further using an indicator reagent 
comprising a second member of the specific binding pair to which is attached a signal- 
generating compound. 

74. An immunoassay kit for the detection of antibodies to HIV-1 comprising an 
antigen construct according to any of Claims 4, 18, 19, 38, 39, 58, 59 and 62. 
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75. An immunoassay kit according to Claim 74 wherein the antigen construct is a 
capture reagent 

76. An immunoassay kit according to Claim 74 wherein the antigen construct is an 
indicator reagent 

77 . An immunoassay kit according to Claim 74 wherein the antigen construct is 
attached to a first member of a specific binding pair, the kit additionally comprising an indicator 
reagent comprising a second member of the specific binding pair attached to a signal-generating 
compound. 
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Abstract of the Disclosure 



Isolated HIV-1 Group O env polypeptides obtained from the HIV-1 isolate HAM1 12 
are claimed, as well as (a) antigen constructs comprising fusions of one or more of each of 
HIV-1 Group O env polypeptides and HIV-1 Group M env polypeptide and (b) further antigen 
constructs containing additional Group O sequences and especially the gp41 IDR of isolate 
HAM1 12. Also claimed are polynucleotide sequences encoding the above, expression vectors 
comprising the same, host cells transformed thereby, and immunoassay methods and kits 
utilizing the antigen constructs of the invention. 
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Figure 13 



r^-CKS 

'msfvviipar yastrlpgkp lvdingkpmi VHVLERARES GAERIIVATD 50 

HEDVARAVEA AGGEVCMTRA DHQSGTERLA EVVEKCAFSD DTVIVNVQGD 100 

EPMIPATIIR QVADNLAQRQ VGMATLAVPI HNAEEAFNPN AVKVVLDAEG 150 

YALYFSRATI PWDRDRFAEG LETVGDNFLR HLGIYGYRAG FIRRYVNWQP 200 

SPLEHI EMLE QLRVLWYGEK IHVAVAQEVP GTGVDTPEDP STALMKIPGD 250 

gpl 20 (HXB2R) r>- gp41 (HXB2R) 

' PGGGDMRDNW RSELYKYKVV KIEPLGVAPT KAKRRVVQRE KR'AVGIGALF 300 

LGFLGAAGST MGAASMTLTV QARQLLSGIV QQQNNLLRAI EAQQHL LQLT 350 

VWGIKQLQAR ILAVERYLKD QQLLGIWGCS GKLICTTAVP WNASWSNKSL 400 

EQIWNNMTWM EWDREINNYT SLIHSLIEES QNQQEKNEQE L L E LDKWVNR 450 

r^gp120 (HAM112) 

VRQGYSPLSF QTHLPIPRGP DRPEGI EEEG GERDRDRSIR L VI GGDMKDI 500 

r^gp41 (HAM112) 

WRNELFKYKV VRVKPFSVAP TPIARPVIGT GTHREKR'AVG LGMLFLGVLS 550 

AAGSTMGAAA TALTVQTHSV IKGIVQQQDN LLRAIQAQQE LLRLSVWGIR 600 

QLRARLLALE TLIQNQQLLN LWGCKGRLIC YTSVKWNETW RNTTNINQIW 650 

GNLTWQEWDQ QIDNVSSTIY EEIQKAQVQQ EQNEKKL LEL DEWASLWNWL 700 

DITKWL'RNIR QGYQPLSLQI PTRQQSEAET PGRTGE 736 



Figure 14 



'msfvviipar YASTRLPGKP LVDINGKPMI VHVLERARES GAERIIVATD 50 

HEDVARAVEA AGGEVCMTRA DHQSGTERLA EVVEKCAFSD DTVIVNVQGD 100 

EPMIPATIIR QVADNLAQRQ VGMATLAVPI HNAEEAFNPN AVKVVLDAEG 150 

YALYFSRATI PWDRDRFAEG LETVGDNFLR HLGIYGYRAG FIRRYVNWQP 200 

SPLEHIEMLE QLRVLWYGEK IHVAVAQEVP GTGVDTPEDP STALMKIPGD 250 

r> gp1 20 (HXB2R) [> 9P41 (HXB2R) 

' PGGGDMRDNW RSELYKYKVV KIEPLGVAPT KAKRRVVQRE KR'AVGIGALF 300 

LGFLGAAGST MGAASMTLTV QARQLLSGIV QQQNNLLRAI EAQQHLLQLT 350 

VWGIKQLQAR I LAVERYLKD QQLLGIWGCS GKLICTTAVP WNASWSNKSL 400 

EQIWNNMTWM EWDREINNYT SLIHSLIEES QNQQEKNEQE LLELDKWVNR 450 

r>-gp120 (HAM112) 

VRQGYSPLSF QTHLPIPRGP DRPEGIEEEG GERDRDRSIR Lv'lGGDMKDI 500 

r>gp41 (HAM112) 

WRNELFKYKV VRVKPFSVAP TPIARPVIGT GTHREKR'AVG LGMLFLGVLS 550 

AAGSTMGAAA TALTVQTHSV IKGIVQQQDN LLRAIQAQQE LLRLSVWGIR 600 

QLRARLLALE TLIQNQQLLN LWGCKGRLIC YTSVKWNETW RNTTNINQIW 650 

GNLTWQEWDQ QIDNVSSTIY EEIQKAQVQQ EQNEKKLLEL DEWASLWNWL 700 

DITKWli 706 



Figure 15 



j>-gp120 (HAM112) r^-gp41 (HAM1 1 2) 

MlGGDMKDIW RNELFKYKVV RVKPFSVAPT PIARPVIGTG THREKR'AVGL 50 

GMLFLGVLSA AGSTMGAAAT ALTVQTHSVI KGIVQQQDNL LRAIQAQQEL 100 

LRLSVWGIRQ LRARLLALET LIQNQQLLNL WGCKGRLICY TSVKWNETWR 150 

NTTNINQIWG NLTWQEWDQQ IDNVSSTIYE EIQKAQVQQE QNEKKLLELD 200 

r^gpl 20 (HXB2R) 

EWASLWNWLD ITKWLIRNIRQ GYQPLSLQIP TRQQSEAETP GRTGEG'PGGG 250 

j»-gp41 (HXB2R) 

DMRDNWRSEL YKYKVVKIEP LGVAPTKAKR RVVQREKR'AV GIGALFLGFL 300 
GAAGSTMGAA SMTLTVQARQ LLSGIVQQQN NLLRAIEAQQ HLLQLTVWGI 350 
KQLQARILAV ERYLKDQQLL GIWGCSGKLI CTTAVPWNAS WSNKSLEQIW 400 
NNMTWMEWDR EINNYTSLIH SLIEESQNQQ EKNEQELLEL DKWVNRVRQG 450 
YSPLSFQTHL PIPRGPDRPE GIEEEGGERD RDRSIRLV 488 



Figure 16 



>CKS 

MSFVVIIPAR YASTRLPGKP LVDINGKPMI VHVLERARES GAERIIVATD 50 
HEDVARAVEA AGGEVCMTRA DHQSGTERLA EVVEKCAFSD DTVIVNVQGD 100 
EPMIPATIIR QVADNLAQRQ VGMTTLAVPI HNAEEAFNPN AVKVVLDAEG 150 
YALYFSRATI PWDRDRFAEG LETVGDNFLR HLGIYGYRAG FIRRYVNWQP 200 

r>-gp120 (HAM112) 

SPLEHIEMLE QLRVLWYGEK IHVAVAQEVP GTGVDTPEDL DPSTNSIGGD 250 

r>gp41 (HAM112) 

MKDIWRNELF KYKVVRVKPF SVAPTPIARP VIGTGTHREK R'AVGLGMLFL 300 
GVLSAAGSTM GAAATALTVQ THSVIKGIVQ QQDNLLRAIQ AQQELLRLSV 350 
WGIRQLRARL LALETLIQNQ QLLNLWGCKG RLICYTSVKW NETWRNTTNI 400 
NQIWGNLTWQ EWDQQIDNVS STIYEEIQKA QVQQEQNEKK LLELDEWASL 450 

^linker r>-IDR (HAM112) 

WNWLDITKWL RNIRQGYQPL SLQIPTRQQS EAETPGRTGE GGGSKLLALE 500 



TLIQNQQLLN LWGCKGRLIC YTSVKW 



526 



Figure 17 



r>gp120 (HAM112) r>gp41 (HAM1 12) 

M1GGDMKDIW RNELFKYKVV RVKPFSVAPT PIARPVIGTG THREKRAVGL 50 

GMLFLGVLSA AGSTMGAAAT ALTVQTHSVI KGIVQQQDNL LRAIQAQQEL 100 

LRLSVWGIRQ LRARL LALET LIQNQQLLNL WGCKGRLICY TSVKWN ETWR 150 

NTTNINQIWG NLTWQEWDQQ IDNVSSTIYE EIQKAQVQQE QNEKKLLELD 200 

Pinker pDR (HAM 11 2) 

EWASLWNWLD ITKWLRNIRQ GYQPLSLQIP TRQQSEAETP GRTGEGGGSR 250 
LLALETLIQN QQLLNLWGCK GRLICYTSVK W 281 



Lll 



< 
IE 

CO 
CM 



00 



i- 

o 



^ o 

O o 

r: o 



< £ 
co E 

0) CO 




CO 
CO 
CM 




CO 

_J 

CL 

> 

1 X 



o 
o 

CD 



O 
O 



o 
o 

CM 



o 
o 
o 



o 
o 

CO 



o 
o 

CO 



o 
o 



o 
o 

CM 



O 
O 
O 



06t? QO uboiai 




ooooooooooo 
ooooooooooo 

OCOCD^CMOCOCO-sl-CMq 

06fr QO ueaiAi 




Q_ 

CO 
LU 



< 
X 

CD 
CM 




ooooooooooo 
ooooooooooo 
ococo^-cnjococottcmo 

C\ii-T--r~T-T-00000 

06fr QO UB3IAI 




oooooooooo 
oooooooooo 

COCD^j-OJOCOCD^fOJO 

06P QO ueaw 



r 




CO 
UJ 






ooooooooooo 
ooooooooooo 
ooocd^-cmococottojo 

C\Jt-t-i-t-t— ooooo 

06P QO uea|/\| 



PATENT 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Certificate of Mailing Under 37 C.F.R. 61 .10 
Express Mail Label No. EH 824 371 060 US 

I hereby certify that this paper (along with any paper 
referred to as being attached or enclosed) is being 
deposited with the United States Postal Service on the 
date shown below as Express Mail Post Office to 
Addressee Service addressed to: 

Box Patent Application 
Commissioner of Patents & Trademarks 
Washington, D.C. 20231 

Date of Deposit: August 15, 1997 
Andreas M. Danckers Date 



nm ARATION AND PQWEP OF ATTORNEY FOR A UNITED STATES PATENT APPLICATION 

Box Patent Application 

Commissioner of Patents and Trademarks 

Washington, D.C. 20231 

As a below-named inventor, I hereby declare that my residence, post office address and 
citizenship are as stated below next to my name. I believe that I am an original and first and 
joint inventor of the subject matter of the invention entitled NOVEL ANTIGEN CONSTRUCTS 
USEFUL IN THE DETECTION AND DIFFERENTIATION OF ANTIBODIES TO HIV which is claimed and 
for which a patent is sought in the patent application attached hereto. 

I hereby state that I have reviewed and understand the contents of the above-mentioned 
specification, including the claims. 

I acknowledge a duty to disclose to the Patent and Trademark Office all information known to me 
to be material to patentability as defined in Title 37, Code of Federal Regulations, §1 .56. 

Claim to benefit of foreign application(s): 

I hereby claim foreign priority benefits under 35 U.S.C. §119 for the following 
foreign application(s) for patent or inventor's certificate: 

NONE 

The following foreign applications for patent or inventor's certificate have a 
filing date earlier than the filing date of the application(s) identified above: 



Applicant(s): John R. Hackett, Jr. era/. 
Serial No.: 
Filing Date: 

Title: NOVEL ANTIGEN CONSTRUCTS 
USEFUL IN THE DETECTION AND 
DIFFERENTIATION OF ANTIBODIES 
TO HIV 

Examiner: 

Group Art Unit: 

Atty Docket No.: 6165.US.01 
Today's Date: August 15, 1997 



NONE 



Claim to benefit of earlier U.S. application(s): 

I hereby claim the benefit under 35 U.S.C. §120 of the following earlier- 
filed United States patent application(s): 

NONE 

Insofar as the subject matter of each of the claims of this/these application(s) is not 
disclosed in the prior U.S. applications in the manner required by 35 U.S.C. §112, first 
paragraph, I acknowledge a duty to disclose to the Patent and Trademark Office all 
information known to me to be material to patentability as defined in 37 C.F.R. which came 
into existence between the filing date(s) of the prior applications and the national or PCT 
filing date of this application. 



I hereby appoint the following Attorneys and/or agents to prosecute this application and any 
continuation or divisional applications based hereon, and to transact all business in the Patent 
and Trademark Office connected therewith: 



Mona Anand, Reg. No. 34,537 
Regina M. Anderson, Reg. No. 35,820 
Mark C. Bach, Reg. No. 34,766 
Cheryl L Becker, Reg. No. 35,441 
Thomas D. Brainard, Reg. No. 32,459 
Thomas M. Breininger, Reg. No. 29,897 
Dianne Casuto, Reg. P-40,943 
Steven R. Crowley, Reg. No. 31,604 
Andreas M. Danckers, Reg. No. 32,652 
Richard A. Elder, Reg. No. 30,255 
Matthew R. Hooper, Reg. No. 31,108 



James D. McNeil, Reg. No. 26,204 
Neal D. Marcus, Reg. 35,267 
Lawrence S. Pope, Reg. No. 26,791 
Priscilla E. Porembski, Reg. No. 33,207 
Nicholas A. Poulos, Reg. No. 30,209 
Gregory W. Steele, Reg. No. 33,796 
Michael J. Ward, Reg. No. 37,960 
David L Weinstein, Reg. No. 28,128 
Steven F. Weinstock, Reg. No. 30,117 
Brian R. Woodworth, Reg. No. 33,137 
Frank Z. Yang, Reg. No. 35,417 
Paul D. Yasger, Reg. No. 37,477 



Send Correspondence to: Steven F. Weinstock 

Abbott Laboratories 



D-377/AP6D 

100 Abbott Park Road 

Abbott Park, Illinois 60064-3500 



Direct telephone calls to: 



Andreas M. Danckers 
(847)937-9803 



Name: (first, middle, last): 
Post Office Address: 
Residence: 

Citizenship: 



JOHN R. HACKETT, JR. 
Libertyville, Illinois 
306 Merrill Court 
Libertyville, Illinois 60048 
United States of America 



Name: (first, middle, last): 
Post Office Address: 
Residence: 

Citizenship: 



JULIE YAMAGUCHI 
Chicago, Illinois 
3034 W. Jerome Street 
Chicago, Illinois 60645 
United States of America 



Name: (first, middle, last): 
Post Office Address: 
Residence: 



Citizenship: 



ALANM. GOLDEN 
Wilmette, Illinois 
2516 Laurel Lane 
Wilmette, Illinois 60091 
United States of America 



Name: (first, middle, last): 
Post Office Address: 
Residence: 

Citizenship: 



CATHERINE A. BRENNAN 
Libertyville, Illinois 
634 East Lincoln Avenue 
Libertyville, Illinois 60048 
United States of America 



Name: (first, middle, last): 
Post Office Address: 
Residence: 

Citizenship: 

Name: (first, middle, last): 
Post Office Address: 
Residence: 

Citizenship: 



ROBERT K. HICKMAN 
Mundelein, Illinois 
524 South Prairie Avenue 
Mundelein, Illinois 60060 
United States of America 

SUSHILG. DEVARE 
Northbrook, Illinois 
2492 Farnsworth Lane 
Northbrook, Illinois 60062 
United States of America 



I further declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that all 
statements made herein were made with the knowledge that willful false statements and the like 



3 



so made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the 
United States Code, and that such willful false statements may jeopardize the validity of the 
application or any patent issuing thereon. 



JOHN R. HACKETT, JR. 


HATC 


II II If \/Al JAOI |/*\ 1 1 1 

JULIE YAMAGUCHI 


nATC 
LJAI t 


Al AM M fSOl DFN 
ML/MN IVI. UULULIN 


DATE 


CATHERINE A. BRENNAN 


DATE 


ROBERT K. HICKMAN 


DATE 



SUSHILG. DEVARE DATE 



4 



